Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0938 |
Symbol | |
ID | 4245677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1474632 |
End bp | 1477691 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638106193 |
Product | hypothetical protein |
Protein accession | YP_720805 |
Protein GI | 113474744 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.391922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC ATATTATTTA TCAATTAAAA TGTAGGTTGA TTAAAGCATT TAAAAGAATC AAATCAATAC ATATATTAAT AATATTATTT TTTTCATTTG TATGCTGGGA AATACTTGGA TTTTCAACTA ATATATTAGC TGATTTTTTA TGGTTTCAAG AGCTGAATTA TTTGCCTGTA TTAATAAATA AACTGCAAAC AGAAACTTGG CTTTGGATAA CAACATTTTT AATTAGTATG GGCTTTTTTT TAGTGAATTT GAGATTAGCC AGCTTTTTTA AATATTCAAA AAAACAAGTT CGGATAACTG AAAGATCTGA AGAAATTATG CTAATTCCCC CAGTTACTAT ACCGTCAAGT AAACTACGAA TTGAGCCGTC ACTAAGTCTA AGCTGGTTGT TATGTTGTAT ATTTGGATTA ATTTTATTAG TAGGATTAAT TTTAACCCAT TATATTGACG TATTTACTAA TTACTTGTAT CCTGATTTAA CAGTTGCTAA TGTATCTCCT CAAATCCCAT CAGAATTTAA CATCGAATCA ATTTGTAAAA TACTCACCTC AATACTATCT AATTTGTGGT TATTAGGATT ATTTCTTCTA TTATCTTTTG CTATAATTAT TAATCCTATA CTCTGGTTAA GTGTATTTGC AGTAGTTCTC AGTTTAGTTT TTAGCTTTAT TCTTTCCAGC CATTGGGCAA ATATATTACA GCTGTTGCAT GGGACTCCCT TTAATAAAAG TGAAGATTTA TTTCATATAG ATATAAGCTT TTATGTTTTT CAACTCCCCG TTTTAGAGCT ATTAAGATTT TGGTTAATTG GATTATTTCT ATATGGATTT GTTGCTTGTA TTTTGATATA TTTATTATCA GGAAAAAGTT TAAGCCAAGG AAATTTTTAT CAATTTTCTC AACAGCAAGA AAAGCATCTT CACGGTTTAG GTGGAGGTTT TATATTAACC ATAGCATTTA GTTACTTTAT AGCCTGTTTT GAGTTACTTT ACTCTCGCCG TGGTGTAGTT TATGGTGCCG GTTATACCGA TATAAAAGTT CAGCTTCCAG CATATGTATT TTTAGGGATT TTGGCGTTAC TAATTGCATT TTTTCTATTT TGGCAAGCAA TTTTTTCAGT CAAAAGTATT CAGTCTTATA TTGAGGCAAG TTTGTGGTTT TTACGTTTAG GTCGTAAGAG AAAAAGAAAG AAAAAAGTTA TTGCTAAACT ATTCGCTAAT AGCTATTCAT TAAGAGCAAT TTTGACATGG TATTTAATAA TAGCAGTAAT TGCTGGTTGG TTAATACCAA AAATTGTACA AATGGCAATT GTCCAACCTA ATGAGATAGA ACGAGAAATT CCTTATATTA AACGTAGCAT TACCTTTACT AAAGAAGCTT ATATTGATGT AGATAAATTA GAAGTAGAAT TATTCGACCC AAATAATGAG CTTACCTATG ATGACTTAAT AAATAATAAG TTAATCATTG AGAATATTCG TCTTTGGGAT ACAAGACCAA TTTTACAAAC GAATCGTCAA TTGCAACAAA TTAGACCCTA CTATGAATTT ATAAATGCTG ATATTGATCG TTATACATTT CTGAAAAAAG AGTCAGAAAG AACAAAAAAT AATCTTACTA AAAAACAACA AGTAATTATA GCTGCTAGAG AACTAAACTA CGAATCTGTA CCTCAGCCAG CACAGACTTG GGTCAATGAA CATTTAGTTT ATACTCATGG CTATGGTTTT ACTCTTTCTC CAATTAATCA AGTTGAAAAA AATGGATTAC CAGAATATTT TGTGAAAAAT ATTGGGCCAG ATCCTACTTT GGAAAAGAAT AGCACTTTAG AAGTATTAAA CAGGATTAGA GACAGCATTC CCATCGGTAA ACCGAGAATT TATTATGGAG AACTTACTAA TACTAATATT ATGACTTCTA CTGCACAAAG AAATAAAGAA TTAGATTACC CTAGTGGAGA AGCGAACTCT TATAACACTT ATGATGGAAG TGGAGGAATT GTTATTGGTC AAGGGTGGCA AAGATGGATA TTTGCTAAAT ATCTTAAAGA CTGGAAAATG TTATTAACTA ATGAATTTAT ACCTGAAACA AAACTATTAT ATCGTCGTAA TATTAATGCT AGAGTCCGAA GTATAGCTCC ATTTCTACGT TATGATCATG ACCCTTATTT AGTGGTGGCT GACCCTAACT TTGGTCATAA AAATATGAAT CAAAAAAATC CTAATTATCT ATACTGGATT ATTGATGCTT ATACTACGAC TAATCACTAC CCTTATTCTG ACCCAGAAAA TAATGAGTTT AACTATATTC GTAATTCAGT AAAAGTTGTA ATTGATGCCT ATAATGGTTC AGTAAAATTC TATGTTGCTG ACCCAAAAGA CCCTATTATT AGAACCTGGA AAAAAGCATT TTCAGATATG TTTAATTCCA TTGAAGAAAT GCCAACTAGT CTTTATACTC ATATCCGCTA TCCACTAGAT TTATTTCAAG TACAATCTGA AGTTTTGTCA ACTTATCATA TGGATGACCC TCGTGTATTT TATAATCGGG AAGACTTGTG GCGGGTTCCA ATTGAGATTT ATGGGGCTCA ACAACAAAAA GTCAAACCTT ATTATCTAAT CACACAATTA CCAACAGAAA CTTCAGAAGA ATTCATTTTA CTTCTACCTT ATACTCCAGC AAGTCGTAAT AATTTAATTG CTTGGTTAGC AGCAAGATCG GATGGGGAAA ATTATGGTAA GTTACTGTTA TATCAATTCC CTAAACAACG ATTAATATAT GGTATAGAAC AAATTGAAGC TTTGATTAAT CAAGACCCAG AAATATCCCA GCAAATTTCT CTTTGGAATC GTCAAGGTTC AAAAGCAATT AAAGGGAATT TATTAGTAAT TCCAATTAAT GAATCTCTGA TTTATGTTGA GCCTATTTAT TTAGAAGCAG AGCAAAATAG TTTGCCAACT TTAAGAAGAG TAATTGTTTC TTATAAAAAC CGAGTTGTTA TGAAGCCTAC TCTTGATGAA GCACTTCAGG AGGTTTTTCA AATACAATAA
|
Protein sequence | MKIHIIYQLK CRLIKAFKRI KSIHILIILF FSFVCWEILG FSTNILADFL WFQELNYLPV LINKLQTETW LWITTFLISM GFFLVNLRLA SFFKYSKKQV RITERSEEIM LIPPVTIPSS KLRIEPSLSL SWLLCCIFGL ILLVGLILTH YIDVFTNYLY PDLTVANVSP QIPSEFNIES ICKILTSILS NLWLLGLFLL LSFAIIINPI LWLSVFAVVL SLVFSFILSS HWANILQLLH GTPFNKSEDL FHIDISFYVF QLPVLELLRF WLIGLFLYGF VACILIYLLS GKSLSQGNFY QFSQQQEKHL HGLGGGFILT IAFSYFIACF ELLYSRRGVV YGAGYTDIKV QLPAYVFLGI LALLIAFFLF WQAIFSVKSI QSYIEASLWF LRLGRKRKRK KKVIAKLFAN SYSLRAILTW YLIIAVIAGW LIPKIVQMAI VQPNEIEREI PYIKRSITFT KEAYIDVDKL EVELFDPNNE LTYDDLINNK LIIENIRLWD TRPILQTNRQ LQQIRPYYEF INADIDRYTF LKKESERTKN NLTKKQQVII AARELNYESV PQPAQTWVNE HLVYTHGYGF TLSPINQVEK NGLPEYFVKN IGPDPTLEKN STLEVLNRIR DSIPIGKPRI YYGELTNTNI MTSTAQRNKE LDYPSGEANS YNTYDGSGGI VIGQGWQRWI FAKYLKDWKM LLTNEFIPET KLLYRRNINA RVRSIAPFLR YDHDPYLVVA DPNFGHKNMN QKNPNYLYWI IDAYTTTNHY PYSDPENNEF NYIRNSVKVV IDAYNGSVKF YVADPKDPII RTWKKAFSDM FNSIEEMPTS LYTHIRYPLD LFQVQSEVLS TYHMDDPRVF YNREDLWRVP IEIYGAQQQK VKPYYLITQL PTETSEEFIL LLPYTPASRN NLIAWLAARS DGENYGKLLL YQFPKQRLIY GIEQIEALIN QDPEISQQIS LWNRQGSKAI KGNLLVIPIN ESLIYVEPIY LEAEQNSLPT LRRVIVSYKN RVVMKPTLDE ALQEVFQIQ
|
| |