Gene Tery_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1707 
Symbol 
ID4244092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2596502 
End bp2599774 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content33% 
IMG OID638106836 
Producthypothetical protein 
Protein accessionYP_721445 
Protein GI113475384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAT CAACTAGTCA AGAACGCCAG TCTAGAGGAA TTAATCGTAC TATTTGTATC 
GGAATAGGAG GTACTGGTAG AGATATTTTG ATGCGAATTC GTAGATTAAT TGTAGATAGT
TATGGGGATT TAGAAAAACT GCCAATTGTG AGTTTTGTTC ATATTGATAC TGACAAGTCA
GCTAGTCAAA TATCTGGTTT AAGAACTGGC AGTACATATC ATGGAGTTGA CCTTAGTTTT
AAGGAAGCTG AGAAAGTTAG TGCCACAATG AGTAGCAGAG ATGTGGATAA TTTTACTCAA
GAATTAGAAA AACATTCCAC TTCTGAAAGA ACAGGACCTT ACGACCATAT TCGTCAATGG
TTTCCTCCTC AACTTAATCG GAATATTAAG GCAATAGAAG AGGGTGCAAA AGGAATTAGA
CCAGTAGGAA GATTAGCTTT TTTTCATAAC TATCGTAAGA TTCAAAAAGC AATTGAAATA
GCGGAAAAAA AAACTAGAGG AAATGAGGGT ATTTTACTAA AAATGGGGTT AAGGGTAGAT
GGGGGAGTAA ATATTTTTAT TGTGGGTTCT CTTTGTGGAG GTACTGGCAG TGGAATGTTT
TTAGATGTTG CTTATAGTTT GAGACATTCT TATGGTCAGC AAAACACTAA AATTGTTGGT
TATTTAGTAA TTAGTCCTGA GTTATATGGA AATAATACTA ATATGAAAGC TAATACTTAT
GCAGCCTTAA AAGAACTAAA TCATTACACT ACTCCTGGTA CAAAATTTGA AGCTTGTTAC
GATATACAAG ACTTAGTAAT TGTTCAATCA GAACGACCTC CTTATGATTA TGCTTATTTG
GTTTCTAATG AGACAACAGG AGGATATAAA ATTCTTGAGC AACGTAAATT ATGTAATGTT
ATTGCTCATA AAATTGCTTT GGATTTTTCT GGAGAATTAG CGCCTATAGT TAAAGGTATG
AGAGATAATT TTTTACAAGA TTTAATCCAA TATGATAATC ACCCTCGTCC TAATATTCAA
CGCTACTTAA CTTTTGGTAT GGCGGCTATT TATTTTCCAC GGGATAAAAT TGTTCAAATA
GCTTTAAATC GTATTAGTTT AAATTTAGTT AATTTCTGGT TAAATGGTGA AGGTCAAAGC
CCAGATTCGG TTTTATTACT GGAAAGATTT TTACTTCAAT ATCGCTGGCA TGATGACTTG
CAAAAACGAG ATGGATTAAG TAGAAGATTA GCCGAATCTG TGGAAGAGTC TAATAAGAAT
TTTCAAAGTA CAATTAATGC TTGGAGAAAT AAATTAGATA GGGCAATTTC TGAGTCTCAA
AATAAGGATG ACCGGATAGC ATTACAGCAA CGATTACCGA GAGATTTTCG AGAGCAATTT
AGAAAGGTTA TTCCCGGAAC AACTGATAGT AGTAGAGGGG TTTGGTTGAC AAGGTTGCAA
CAAATTTGTC CGAAAATTAC AGTGGAATTT CAACAAAATA TTAATAGTTT TATTGTGGAT
TTGCTAAGTC CGAGTAATGA TAATTTTTCT ATTAGAAGTA GTCGGGAATG GCTAGAAGCA
ATGAAAAATG AACTGAATAA TTATCAAAGG GATATTGAAG AAAAAATTGT AGATTTTAAT
GGTAGTAAGA TATTAGAGGA TGTAGACAGA AAATGGCAAG ATGGCGAACA GATAATTGCA
GATATTGAGA ATAAATTTGA ACTGCCTTTA TTGGGAGGTA AGAATAGAGA ATTTCAAGCT
GAAGCTAAAA GAATAGTCAG ACAAATTTGC GATTTAATTA GACATAATTT TGACTTAATA
GTTTTCCAAG AAACTCTTGA AATAGTAAGA AATTTACAAA ATCATGTTGA AAATATTGCT
GCCCAACTAA GAGAATTTAC AAACTTACTT AATAACCTTA AAAATGACTA TGAAAAAGAA
CAAGCAAATT TAAAACAACT AGACTTTGAT GAAATGAGTG GAGAAGCTAT TTTTGCGGAG
GAAGATATCA ATAACTGTTC TCAAATTTTA CTCCCTCTCG AAGAGCTAAA GTCACAATTG
GTACAACTAA GTTCGGAAAT TACTAAATCG TCTGAAGACG GGCAATCAAT TGCAATTTTT
ATCAGAGAAA AAGTCAGGGA AGAAGAGCAA GTAAAAGAAG AAGTTAATCT CACAGTAGAT
AGGTTATTTG GAGTTCGGAG TACTAATATT GTTAATTCTG TAATTAAGCG TTTTCTACAG
AACTATTCTA GGGTTGAAAG GAGCACTCGT TTGGGACAAA TTATGCAGGA GGCTGAACCA
CTTTTACCTT TAGATTTTAA TGCTATTTAC TTCAGAGATA GGGATTCTAA ACACAGTCAA
TTAATAGGAT TTAAAGATAC TGATGATGAA GGAGTTAAAC AATTTAAAGA TATTTTGGTG
GGAGATTTAG GAGTTGACAA TAATACTCTT AAACCAACTC AAGCGGAAGA CCAAATTTTA
ATCGTTAAAG AATATGCAGC TTTTCCTTTG AGACTAATTA CTAATCTTGA GCAAATGAGA
AATTCTTACA TTCGAGAAAT GAATTATTCT ACTTCTTTTC TCCATAATAA CAATCGGATT
TTCTTTGCTG ATATTATTCC TCCAGATGCA GCAAAAATAG AAGAGTTAGA AGATATTTTT
TATCCTTGTT TGGCTTTAGA ATTAATTCAA AAAAATGGGG ATGAGTTATT AGAATTTATG
TATTACAATG ATATGCGAGA TAGTTATTAC ACTGCGGAGC TTAGTTCAGT TTGGAATCAA
GCTTTAGAGG AATTAGCTAA TCGGCAGGAT ATGACAGAGG CTTTGAAAAA TATTCTGGAT
AAAGAAGTAG AAAAAATAGA TAAAGAACCT CTGCTCTGGG AAAACTATTA TTTACCGAAG
TTGCAAAAGT TTGTTAAACA AATTGATAAT TTACCGGAAG ATGATCTTAA TTATCCTTAT
AAAGATACGG TAGTTGGAAC ACCACCAACA ACTGAAAATT CTGGAAAGGA AGGAATTATT
AATCGCTTTC GCCGTCGTAT ACAAAAAAAA GTAAAAAATA TACAATTACT TCAGTCAAAA
TCTACTGTAA CAGAGGAAAA TAAGGAGGTA ATAAAGGATG AAATTATTGA CGTTGATATT
GATTTAGATA ATTCTCCAAA TTATCGCGAT AAGCAACATA ATTCAGGAGA TGACCGGATG
AAACAATTGA AGGATTTAGT AAGGATGAAA GAACAGGGAT TTTTGACAGA TGCAGAATTT
CAGGCAGCTA AGAAAAATAT ATTAGGTATT TAA
 
Protein sequence
MIQSTSQERQ SRGINRTICI GIGGTGRDIL MRIRRLIVDS YGDLEKLPIV SFVHIDTDKS 
ASQISGLRTG STYHGVDLSF KEAEKVSATM SSRDVDNFTQ ELEKHSTSER TGPYDHIRQW
FPPQLNRNIK AIEEGAKGIR PVGRLAFFHN YRKIQKAIEI AEKKTRGNEG ILLKMGLRVD
GGVNIFIVGS LCGGTGSGMF LDVAYSLRHS YGQQNTKIVG YLVISPELYG NNTNMKANTY
AALKELNHYT TPGTKFEACY DIQDLVIVQS ERPPYDYAYL VSNETTGGYK ILEQRKLCNV
IAHKIALDFS GELAPIVKGM RDNFLQDLIQ YDNHPRPNIQ RYLTFGMAAI YFPRDKIVQI
ALNRISLNLV NFWLNGEGQS PDSVLLLERF LLQYRWHDDL QKRDGLSRRL AESVEESNKN
FQSTINAWRN KLDRAISESQ NKDDRIALQQ RLPRDFREQF RKVIPGTTDS SRGVWLTRLQ
QICPKITVEF QQNINSFIVD LLSPSNDNFS IRSSREWLEA MKNELNNYQR DIEEKIVDFN
GSKILEDVDR KWQDGEQIIA DIENKFELPL LGGKNREFQA EAKRIVRQIC DLIRHNFDLI
VFQETLEIVR NLQNHVENIA AQLREFTNLL NNLKNDYEKE QANLKQLDFD EMSGEAIFAE
EDINNCSQIL LPLEELKSQL VQLSSEITKS SEDGQSIAIF IREKVREEEQ VKEEVNLTVD
RLFGVRSTNI VNSVIKRFLQ NYSRVERSTR LGQIMQEAEP LLPLDFNAIY FRDRDSKHSQ
LIGFKDTDDE GVKQFKDILV GDLGVDNNTL KPTQAEDQIL IVKEYAAFPL RLITNLEQMR
NSYIREMNYS TSFLHNNNRI FFADIIPPDA AKIEELEDIF YPCLALELIQ KNGDELLEFM
YYNDMRDSYY TAELSSVWNQ ALEELANRQD MTEALKNILD KEVEKIDKEP LLWENYYLPK
LQKFVKQIDN LPEDDLNYPY KDTVVGTPPT TENSGKEGII NRFRRRIQKK VKNIQLLQSK
STVTEENKEV IKDEIIDVDI DLDNSPNYRD KQHNSGDDRM KQLKDLVRMK EQGFLTDAEF
QAAKKNILGI