Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3205 |
Symbol | |
ID | 4243800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4899215 |
End bp | 4901035 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638108207 |
Product | hypothetical protein |
Protein accession | YP_722798 |
Protein GI | 113476737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.243951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0338261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCATAG AGATATTAAC CCTGTTTGCG GGTGCAGGAC TAGAACAAGC AACCAGTGTC ACATGGCATT GGGCAAGTAA ATGGGCAGCA AAGAAGGCTG ATGAAAAGAG TCAGGTAATA CAGACAGCAT TACTAACTTC ATATTGTAAA GCTTTGCAAG TAATTCGTAA AGAGTCTCAA CAACTACCTC AATGGAAACA AGAGAAAGCT GAAATTCAAG CTAGATTAAC AGAAATGGAG AAAGATGCAC TCATGATATT AGATTTAGCA CAGACACAGA AGCATAGAAT GACTAGTATC CCTCAAGGTG GAGAAGAACA AGCGATCACA AAAACAGTAA TGTCTAGACT GCAAAGTAAA GATAAATGGC TCAAACATGC TCCAAAAGAA TTATCTTATT TAGTTGAAAC TAGATTAGTA TCAATTACAG TGCATCAGTT TCGCCACGAA CTGAGAACAG AAAGCCGGGT ATTTCAAGCC TTAGTCCACG ATATGTTAGC TACAGCAGGA GAAGATGTTA GTCGTTTGCT TGATGTCACT AATAGAATAG AGAAAAGAGT AGATCAATTT GTGGGAGACT ACCGTCAAGA TATTGACAGA ATTATTGTAG GTATTGGAGG AGCTAACTCC CACCTAGAAC AGTTAATATC ACTGTTAAGC AGACAGATGA AGCAATCTCA AGCTATACCT GCTGACTTTG CTGCTCTGAT AGCGGATAAA ACAGAAGGAT TTCTCGGACG AGGTTTTGTC TTTGACACGA TAGAGAATTT TATCCAAAAT CAATCCAAAG GATATTTGAT CATTGAAGCA GATCCAGGAG TAGGTAAAAG TACAATTATA GCCGAATATG TACGTCGCAC AGGTTGTATT GTCTATTTTA ATTTATTATC AGAAGGCAGA ACTAGGGCTG AAGATTTTCT TAAGAGTGTG TGTACCCAAT TAATTGAACG TTATAATCTT CCTTATGAAT TACCATTACA TCCAGATAAT ACCAGCAATG GGAACTTTCT GTCAAAACTA CTGAATGAAA TTAGTATCCA ATTACTCCCT AAAGAAAAAC TTATAATAGC AGTGGATGCT TTGGATGAAG TAGATTTAAG TAGTCAGAGT GGAGCAGCTA ATGTCCTATA TTTACCAGCA AACTTACCTA ATAGAGTATA TTTTCTGCTT ACTAAGAGAT CGGATCCACT ACCATTAGTA GTGTCGGCGC CACAGAAAAT TTTTGATTTA ATGATCTATC CTAATGAAAG TTTGAAAGAT GTTAAACTAT TTATTTACCT AAGAACAAAG CGAACTACAG TTCAAGCATG GATTAAAACT AGAGGATTAA CATTAGAAAA ATTTGTCGAA TCTATAGCTA GCAAAAGCGA AAATAATTTC ATGTATTTAA AATATGTACT GGATGATATT GAGCAAGGCA GCTATAGAGA TATCAGCTTA GAAAGTTTAC CCCAAGGCTT AGAGAAATAT TATGATCAAC ATTGGCATAG AATGAAAATG AATGTTAAAC CATTGCCAAT TACTAAACTA AAAATTATAT ATTTCTTAAC AAAAACTCGT AAGCCTGTAT CCTGTGGAAT GTTAGCTAAA TTTAGTGGAG AAATATCTTT GACAGTTCAA GAAATATTGG ATGAATGGGC GCAATTTCTG CGGATTCATC AAACAGAAGG CAAGCCGGTG TATAGCATAT ATCATGCTGC TTTTCAAGAT TTTCTCTACC GTAAAGATAT TGTTCAGGAA GTAGAATCGT TAGTTGAAAT TGAAAGTATG GAAAAACAAA TCAAAAATAA TTTGTTAGGA ATGGTTTATA AAAATGGTTA G
|
Protein sequence | MLIEILTLFA GAGLEQATSV TWHWASKWAA KKADEKSQVI QTALLTSYCK ALQVIRKESQ QLPQWKQEKA EIQARLTEME KDALMILDLA QTQKHRMTSI PQGGEEQAIT KTVMSRLQSK DKWLKHAPKE LSYLVETRLV SITVHQFRHE LRTESRVFQA LVHDMLATAG EDVSRLLDVT NRIEKRVDQF VGDYRQDIDR IIVGIGGANS HLEQLISLLS RQMKQSQAIP ADFAALIADK TEGFLGRGFV FDTIENFIQN QSKGYLIIEA DPGVGKSTII AEYVRRTGCI VYFNLLSEGR TRAEDFLKSV CTQLIERYNL PYELPLHPDN TSNGNFLSKL LNEISIQLLP KEKLIIAVDA LDEVDLSSQS GAANVLYLPA NLPNRVYFLL TKRSDPLPLV VSAPQKIFDL MIYPNESLKD VKLFIYLRTK RTTVQAWIKT RGLTLEKFVE SIASKSENNF MYLKYVLDDI EQGSYRDISL ESLPQGLEKY YDQHWHRMKM NVKPLPITKL KIIYFLTKTR KPVSCGMLAK FSGEISLTVQ EILDEWAQFL RIHQTEGKPV YSIYHAAFQD FLYRKDIVQE VESLVEIESM EKQIKNNLLG MVYKNG
|
| |