Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44473 |
Symbol | |
ID | 7197756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 676155 |
End bp | 678173 |
Gene Length | 2019 bp |
Protein Length | 652 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178554 |
Protein GI | 219115517 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCCTTTGTG TTCCCGGACA GTGAGTGAGC CACCCCCTTC TCCCACCATC ACGCTGAATC ATGACTGGTC CGACACCGCA GGATGAAATT GATCGACAGC ACGCACAAGC AAACAAGTTG GTGGAGCAGT CGCAACCTAC ACCAGCTGTT CGTGTGTCGG ACGGCAAGGA AATCGCTCTG CGCTACAAGA CTCGTCCCGT AGACGACCTT CTAGCACGGG GGAGCGCCAC CTTGATCGAA TCTGCAACGA ATCCGCAAGA CGACGACGAT CGGGAAGACT GGGTGGTCGT CGATGAACAC TTTCCCACTT GGAACACGCG TGGTTTTGTC GATGTGACGG CCTTGCAGGA CTTGGTTCGC GAAGGATACA ATCAGACCGT CCCGGTGATA CCGATGTCGG TAACAAAAAC CACGACGACG ACAGCAACAG AAAGCACGAC GACAGGGTCG CCCAATCCAT CCACTTCCGG TTCCCAAAAG CGCAAAACGT TTCAAGCCTC CGACGTTAAA TCCATCACCA ATTTATGGAG CGAACCCAAC GCCGCCGTCC ACAACGTCGC CATCTGCCGT CCCTCGCACG ATGCCTGGGG TATCAACAAA ATCGTTCTCG TCTTTTGCGA CGACTTTTTG CGCGATATTT ACGAACTGCC TTGGTGGCAC GGGCGCGATG ACATGCGGTT GGCCGTACAA CCGATTCTGG ACGTTTTGCG CATTCCACCG CAGCGGATCG TCCGCATGCT GCTGGCGTCC TTGCCACCTG GGGTGACCAT TCCCGTCCAC CACGATACGG GAGAATGGGT CCGACACACA CATCGCGTCC ACGTACCGGT TCTGGTGCAG AACCCTGATC GGGTCGTCTT TCAATGCGGC GTAGCCCTGG ATTCCCTCCA GCGCATCCCC TGTACACCTG GTCACGTGTT TGAAATCAAC AACCAGGCGA AGCACGCCGT ATCCAACTGC GACACTGATC ACCGGGTGCA TTTGATTCTC GATTACGTTG ACGCCGACTT TCCCTTGCTC CCCCGGATTC TTCTCCGACC CGGTGAAAAA CTTCTCCAAA CACGGCGGTC GATTGATCGA TTGGAGAATC GGGGGTCGCG TCCCACACCT TCCTTTTTAA TCCTCGGAGC GCAAAAAGCT GGGACCACAT CGCTCTACGA ATACATCGTC CAGCATCCTT TGGTGGTTCC TGCACGCCGT CGGGAAACTC ACTGTTTGGA TTGGCGCTGG AACGATAAAC TCAAGTCCGT CAAAGCGCAA CGCGCTTGGT GTCACAAGTT TTACCTCACC CAGGAACTGT CTCTCCATCC GTCGTGTTTA ACAGGCGATT CCACACCATC CTACTTGCTC GATAGCCGTC GGGTAATCCC ACGCATTCGC AACATTTTCG ACTGGCCCCT CAAGTTTTTC GTAATGTTAC GGAATCCCAT CCGGCGGGCG GAGTCACATT TTGCCATGGT TACCAGTTCA GAGGGTACAC CAGCGCAGCT AAAGGCTCGA GGCTCTGAAT GGCGAAACAA AACGTTCCGA GAGGTAGTGC ACGACGATAT GCGCACGATG CAAACGCATG GACTGATCCC GTACTGGAAT GTTGACGACG GTGTTTTGGA TATAGCATGT TTTGATACGT TTGTGGGTAG CCCAGAAGAA GACGCGGCGT ACGATAGGTA TCTAGCACAG GTCCCGTTAC ACACGGGATC CCACAGTCTG ATCAGCCGCG GATTGTACGA ACTACAACTT CGACCCTGGT TCGTGGCGTT TGACCCCGCA GCTTTTCTGG TTATGAAGCT CGAGCATTTT CGAGAACGCG GCGTAACAAC TGCTATGGAA GCCGTTTGGA AGCATTTGGA TTTGCCATGC GTTTCGATAC AGAATGAAGA AGCCAAAAAC GTTCGTTCTT ACGATCCGAT CGCCGATGAC TCGATGCAAG TCTACTTGGA ACGCTTCTAC GCACCGCACA ATGAGCGCCT GGTGAGTCTT TTGGGCGACG ATGCGTGGAA TCAAGCCTGG TGTCCATAG
|
Protein sequence | MTGPTPQDEI DRQHAQANKL VEQSQPTPAV RVSDGKEIAL RYKTRPVDDL LARGSATLIE SATNPQDDDD REDWVVVDEH FPTWNTRGFV DVTALQDLVR EGYNQTVPVI PMSVTKTTTT TATESTTTGS PNPSTSGSQK RKTFQASDVK SITNLWSEPN AAVHNVAICR PSHDAWGINK IVLVFCDDFL RDIYELPWWH GRDDMRLAVQ PILDVLRIPP QRIVRMLLAS LPPGVTIPVH HDTGEWVRHT HRVHVPVLVQ NPDRVVFQCG VALDSLQRIP CTPGHVFEIN NQAKHAVSNC DTDHRVHLIL DYVDADFPLL PRILLRPGEK LLQTRRSIDR LENRGSRPTP SFLILGAQKA GTTSLYEYIV QHPLVVPARR RETHCLDWRW NDKLKSVKAQ RAWCHKFYLT QELSLHPSCL TGDSTPSYLL DSRRVIPRIR NIFDWPLKFF VMLRNPIRRA ESHFAMVTSS EGTPAQLKAR GSEWRNKTFR EVVHDDMRTM QTHGLIPYWN VDDGVLDIAC FDTFVGSPEE DAAYDRYLAQ VPLHTGSHSL ISRGLYELQL RPWFVAFDPA AFLVMKLEHF RERGVTTAME AVWKHLDLPC VSIQNEEAKN VRSYDPIADD SMQVYLERFY APHNERLVSL LGDDAWNQAW CP
|
| |