Gene Tery_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4018 
Symbol 
ID4244585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6212390 
End bp6214834 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content34% 
IMG OID638108928 
Producthypothetical protein 
Protein accessionYP_723509 
Protein GI113477448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.505763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAA TTCCAACTCA ATGTCAAGAC TTAGAAAATC AGGTAGGTAT ACTACTTCAA 
CTATTATATC AAGAGCCTAC TCTTAGACAA CAAGATACAA CAGCAGTTCG TTCTTCCTTA
AATAAAGTAA TTTCCCCAAA GTTTGAAATA GTATTTGCCG GAGCTTTTAG TGCAGGAAAG
TCGATGTTAA TTAATGCTTT ATTAGGGAGA GAATTACTTT ATAGTGCTGA AGGTCATGCA
ACAGGAACAG AATGCTATAT CGAATATGCT GAACCAGAAA AAGAAAGAGT AATTTTGACA
TTTTTAAGCA GAGCTGAAAT TCAAGAACAG GTAAGAATTT TACGACAGCA ATTAGGTTTA
GAATCTAATC CCAATATTGA ACAATCAGAA GTAATTGGAA TTATCCGGGA ACAATGCAAA
GCAATTATTA AAAATGAAGG AGGAGAAAAT AAATCGGAAC GTGCCAAGCA AGCAAAAGCA
TTAAAATTAT TAGTGGATGG TGTTGAGACA AATAAAGAAT ATATTAATCC TATTGAAAAT
GCTATTTATT CAATGGAACA GTTCGACTTT TCTAACCTAA AAGAAGCAGC AAGTTATGCC
AGAAGAGGAA TGAATAGTTC TGTGTTGAAG CGCGTAGAAT ATTATTGTCA TCATCCTTTA
TTAGAAGATG GCAATATTAT TATTGATACC CCAGGAATTG ATGCTCCTGT AGAGAAAGAT
GCTCAAGTTA CTTATGACAA AATAGAAAAC CCAGAAACTT CTGCAGTTGT CTGTATTTTA
AAAGCTGCTG CTGCAGGAGA TATGACAATA GAAGAAACGG AACTGTTAGA GAAAATGCGA
GAAAATCCTG GTATTAGAGA TCGGGTTTTT TATGTCTTTA ACCGTATAGA TGAAACTTGG
TACAATTCTC AACTTAGACA ACGATTAGAC AATTTGATTA TTTCTGATTT TCGAGATACA
AAAAGGATTT ATAAAACTAG TGGCTTATTA GGTTTTTATG GTAGTTTAAT TAAGGATACA
AGCGGGCGGG AGCGCTTTGG TTTAGATAGT ATATTTGCTA ATAGTATTAA AGGTTTAGAT
GGAGAGGAAG AAACACCTCA ATTTGTTTAT GCTTTTAATA ATTATTGTGG CACTTCTCGC
AAGTTGCCGA GTAATTTTCG GGTCAGTATT AATGGTTTTG AGTCACCTAA TGAAAATTAT
GTCAGGATTT TAGCGGACTG GGGTAAACCA TTAATCAAAC AATTAATAAT TGATAGTGGT
ATAGAAGAAT TTAGGGATGG AATTACTCGA TATTTAAAGG AAGAAAAACG CCCACAATTA
TTTGCAAATT TGGCTGATGA TTTACAAAAT ATTTGTATAC AGTTACAGAA AAATTATTTG
GCAACTCAGC GAGACTTAGC AAGTCAACCT CAAGAAATAG AGGCAATGAA AGCTCAGGAA
TTATCATTAT TAAATTTTCA ACTTCAGGAG GTGGGCGAAG GATTTAATCA ACATATTACT
GAGGAAGTAA ATTTACTAGT TACAAATAAA TATGATGCTT TTGAAGCAGA TTTTAATCAA
CTTCAATCTC GAATGATTCG TCGTTTAGAT GAACTGTTAG ATACTTTTTC TGTAGAGGCT
GCTTATAGTC GGGCAACTGT TAATCATCCA CGGAATGCAA CGGCTCCTTT ATTGGCAGTT
TTAGTTGAAG CACTTTATTA TTTAGCAAAT CAATTAGAGG ATATTTTAGT TGACTCTACT
CAGTCAGTAA TAACTGGATT TTTTCAAGGT TTAGTTGACC GAATTCGCAA GTCAGAATAT
TATCGGCAAC TCTATCGTTT GTTAGGTAAT GATGGAGGAA TTGAAAAATG TTTAAAAGAG
GTAGAAAAAA AGGTTAGTAC GGCATTAATA AGTGCAGCGC GAATAGAATG TGATAGATTT
GTGCGAGAAA GTCCGCGTTT TTATGATGAA GGAACTTTTT CTATCTATCA GTTTCGGCAG
ACTTTGTTAC AGACTTCTCA AAGTTTTGAC TGTAGCAGTA TGGTGGAGGC GGAACCTGCA
ATTAGACAGT TGTTGAAGTT AGATTTTGAG CCAAAAGTTT CTCAGACTAT TCGGCAAACT
TTTCGACAAA CTGTGAATCA AACTTTGAAG ACTAATTTAT TACCAATGGC TGAAAAAAAA
GGGGAAGATA TTATGCAGCA ATATAATCAA GCTCGGAAAT ATTTAGAGCA AAGTTTAGAG
GCGGAAGCTG GAGAGAAAAT TCAAAGAAAT TTACGTTTAC AAGCTGAGGT AGATGAGAAG
GTTAAGGTTT ATAATCAAGC AGTTTCTTCT ATTAATAGTT GTCTACAGGT GATGTTGTTG
AATGAGCGAC AGTTGCCAAT CATTTCTGAT TTTAATTTTG GCAAAGGTGA GGAAAATTTG
GAGGTACTAA GTAATGAGGA TGAGAAGTTA TTGGAAGGTG AGTAA
 
Protein sequence
MNTIPTQCQD LENQVGILLQ LLYQEPTLRQ QDTTAVRSSL NKVISPKFEI VFAGAFSAGK 
SMLINALLGR ELLYSAEGHA TGTECYIEYA EPEKERVILT FLSRAEIQEQ VRILRQQLGL
ESNPNIEQSE VIGIIREQCK AIIKNEGGEN KSERAKQAKA LKLLVDGVET NKEYINPIEN
AIYSMEQFDF SNLKEAASYA RRGMNSSVLK RVEYYCHHPL LEDGNIIIDT PGIDAPVEKD
AQVTYDKIEN PETSAVVCIL KAAAAGDMTI EETELLEKMR ENPGIRDRVF YVFNRIDETW
YNSQLRQRLD NLIISDFRDT KRIYKTSGLL GFYGSLIKDT SGRERFGLDS IFANSIKGLD
GEEETPQFVY AFNNYCGTSR KLPSNFRVSI NGFESPNENY VRILADWGKP LIKQLIIDSG
IEEFRDGITR YLKEEKRPQL FANLADDLQN ICIQLQKNYL ATQRDLASQP QEIEAMKAQE
LSLLNFQLQE VGEGFNQHIT EEVNLLVTNK YDAFEADFNQ LQSRMIRRLD ELLDTFSVEA
AYSRATVNHP RNATAPLLAV LVEALYYLAN QLEDILVDST QSVITGFFQG LVDRIRKSEY
YRQLYRLLGN DGGIEKCLKE VEKKVSTALI SAARIECDRF VRESPRFYDE GTFSIYQFRQ
TLLQTSQSFD CSSMVEAEPA IRQLLKLDFE PKVSQTIRQT FRQTVNQTLK TNLLPMAEKK
GEDIMQQYNQ ARKYLEQSLE AEAGEKIQRN LRLQAEVDEK VKVYNQAVSS INSCLQVMLL
NERQLPIISD FNFGKGEENL EVLSNEDEKL LEGE