Gene Tery_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2222 
Symbol 
ID4243256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3464417 
End bp3465613 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID638107324 
Producthypothetical protein 
Protein accessionYP_721924 
Protein GI113475863 
COG category[S] Function unknown 
COG ID[COG4370] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03492] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.917387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0837623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA AAAATATACT TTTTTTAAGT AATGGTCATG GAGAAGATGC CCATAATTGT 
CAAATTATTA AAGCTTTTAC AAAAATTTCT CCAGATACAA ATATATCAGC TTTACCTATT
GTTGGTGTTG GTAATAGTTA TGAAAATTTG AATATACCAA TTATTGGTCC TCGAGTAAAT
ATGCCATCAG GAGGATTTTT ATATCTCAGT CCTTTATTAT TATTTGAAGA TTTGGGAAAA
GGTTTAATTA GTTTAACTTG GCAGAAGTTA CAAACTATTT GGACATTTGC GAAAAACTGT
GATTTAATTA TGGCTACTGG AGATATTGTA GTGGCAGCTA TGGCTTATTC GACAAGGCTT
CCTTATATGA TATTTCTTTC GGCTGATTCT AGTTATTATG AAGGTCGGAT TAATTTGGGT
TTAATATTGC CAAAGTTACT TCATAATTCT CGATGTTTAA AGGTTTTTGC TAGGGATGCT
TTGACGGCTA AAGATTTAAA AAGACAAGGA GTTACAAAAA CAGAATTTGT TGGTACTCCA
GTGATGGATA ATTTAATTTC AACTGGAAAA AATTTACGGC TTAAAACGGA ATTGTTTACT
ATTGCTATTT TGCCTGGTTC TCGGTTGCCG GAAGCTGGTA AAAATTTATG TTTGCTGTTG
AAACTGGTTA GAGAAATTGT CAAAGTTATG GGAGTAAATG TTTGTCAGTT TCGAGCTGCA
ATTGTTCCTA TTTTAATGTT TGAATTAGAG GCGATCGCTA TTTCTGAAGG TTGGGAATGT
CAAGGAAGTA AGCTAACATT TTTTACTCAG GAATATACAA TAGAAGTAAT TTGTTATGAG
GATGCTTTTG CAGATATTTT ACAACATTCA AGTTTGGTAA TTGGTATGGC TGGAACTGCA
ATAGAACAAG CTGTGGGTTT AGGCAAACCT GTAATTACTA TTCCTGGTGA AGGTCCTTCA
TTTACCTATC GTTTTGCGGA AGCTCAAACT AGACTTTTAG GTTCTTCTGT ACAGGTTATT
GGTAAAAGAA TGGCTAATAG TTTTATTCTC CAAGAAGCAG CTAGAAAAGT TAAAGAAATT
TTGGCAGATG AAGAGTATTT ACAAAGTTGC ATTAATAATG GTTTAGAAAG GATGGGGAAG
CCTGGTGCTA GTGAAAAAAT AGCTAATTAT CTTGTTAAGT ATCTGAGTTC AGACTAA
 
Protein sequence
MKTKNILFLS NGHGEDAHNC QIIKAFTKIS PDTNISALPI VGVGNSYENL NIPIIGPRVN 
MPSGGFLYLS PLLLFEDLGK GLISLTWQKL QTIWTFAKNC DLIMATGDIV VAAMAYSTRL
PYMIFLSADS SYYEGRINLG LILPKLLHNS RCLKVFARDA LTAKDLKRQG VTKTEFVGTP
VMDNLISTGK NLRLKTELFT IAILPGSRLP EAGKNLCLLL KLVREIVKVM GVNVCQFRAA
IVPILMFELE AIAISEGWEC QGSKLTFFTQ EYTIEVICYE DAFADILQHS SLVIGMAGTA
IEQAVGLGKP VITIPGEGPS FTYRFAEAQT RLLGSSVQVI GKRMANSFIL QEAARKVKEI
LADEEYLQSC INNGLERMGK PGASEKIANY LVKYLSSD