Gene Tery_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4433 
Symbol 
ID4246086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6832866 
End bp6836132 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content31% 
IMG OID638109316 
Producthypothetical protein 
Protein accessionYP_723893 
Protein GI113477832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0682487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TCTTGTATAT ACATATAGGA ACTGGTAAAA CAGGAACCAG TTCAATACAA 
AATTTTTTAT CTGAGAATCA AAAAAATGGC AGGTTATCAA AGCACTTTAA TACAAGGTAT
TGTTTCTCGG GTCGTCCCAA TATTCATAAT ATATTGTTGT GTAAAAATTA CCAGAGGGGA
AATGAGAAGA CTCAAGAGTT GATAGATGCA AAGATTCGTG AATTGAGAAA GGAAATTTAT
CAAAGTTCAG AACAGTTATT TATTATTAGT TCTGAGTATT TTCCCGGAAA CACTCAAGAT
GAGATTCATG AATTAGTAAA TACTGTAAGT GATATATGTC AAGTTAAAAT TATTGTTTAT
TTAAGAAGGC AAGATGACTT TTTATGGTCT TGGTATACTC AACTTGTCAA GACAACTAAT
ATATTTCTTG ATATATACAA ATTAAAAGAT AAATTGTATA AAGGCCCGCA GCAGGTTTTC
AACTATGAAA CAAGTTTAAG AAAATGGGAG AATATCGTTG GGAAAGAAAA CATATTAGTC
AATATTTACG ATAGGAATAA TTTTAAGGAA AATTCAATAT ACTTTGATTT TATTCACAAT
TTAAATATTG ACTTAGATCT GAAAAAAATG ATAGTTCCAG AAGAGGTAAA TCTAAGCCTT
CAATGTGAAC AAATATGTCT GATACGAACA CTTATTCCTT ATGTGAGTGA ATGGAATGAC
GAACAACTAA AAATGTTGCA AAAACCACTT AATATCCCGT TTTTGTATAC CAAAAGTCTC
CTTTCCCCAA GAGAACGAAA AGAAATTTTA AATGACTTTA GACAACACAA TGAATTTGTG
GCTAGACGAT ATCTTAATCA AGAAAATTTG TTTGATGAAA ATATTGAAGA TGAAGACACA
TGGAGAGAAC CAAATATATT TAGTTCTGGA TATTTTGAAA AATATATTGA TTATATCAAA
ACCGAAAAAA AGACGTTGTT TGAATTACTG CAACCGATAT TAAAGAATGT CAACAGAATG
AAAATCAAAA CAGTGAGAAA TATGTTATTA GTTAATAACA TATTTATAGT TGATTGGACG
GAGAAGGCGG GTTGCACCAT AGTATGTAAA ATGTTTTTTG ATGTTATGGG GATACTCCAA
GAAGCCTTAG ACTATAATAG TTGGATACAT AATTATCGAC AAGATAAATT TTATGAAAAA
TTTGGTAAAG TAACGGAGGA TATGCTTTCT GGGGACAAGT TTGTGAAAAT GAAATTTGTC
AGAAATCCAT ATACCCGTGC AGTTAGTTCT TATTTTGCTG CAAATGAAAA CAATAAACAA
TATTTAAAAG GATATGAACA TCTTGATTTG TCATTTTATG ATTTCTTGCT TTTAGTTAAG
CAGAAAAAAA TAGTTAATGA TCATTGGAGA ACTCAATATC AAGAAATTGA ATCCAGATTT
AATTTTGATG AAATAATTAA AATTGAAAAT ATTGAACAAG AGGTAAAAAG ACTGAATCAA
AAATATAGTC TTAATTTACA ACTCCATACA CATTCTAGCC ACCATCATAA AAAAGATAAA
AATAAAAGTG AATTTGTTGG CAGAAAAAAG TATAGTGAGT TGGAAATTTC TGGTACTATT
CCAGATTATA GAAACTTCTA TGATGATGAA ACTAAGTTAC TTGTGAGTGA AATTTATGAA
AAAGATCTTA AAACTTATAA GTATACATTC GATCTGGAAA AAAATCAGGA AAGAATTAAA
GCAGAGAATA CGGATAAAGA TAAAAAAATG AAACACAATA TCAATTACAT TACAATTTGG
GAGAACGAAT GGAATTCGGT TTATGCTGAC GAATTTCTAA TTAAAGCTGC CAATTCTGAT
ATTAATACTG TTGCCTTTGA CATGCGGTGG CACAAGCACG AAACAAATGA AGGAAATTTT
GATTTTTCTC ATACTCAAGA CAGATGCGAT AAACTCATTG AGCATGGGTT CAAACTTATG
CCATTAATTT CTATCTTTTA TTGTCCGAAT TGGGTTCATG AAAAATATCC TGATATAGTT
GAGTTGAACG AAACCTTCGG TGCCATAGGC TCGGGTCATA AAGGTGTAAG TTCTGCATAC
GAAAAGTCAT TGCCCTTGGC ACTCAATTTT ATAGATAAAT GCATTGATGC ACTGGAATGC
TATAGACAGG ATATAACAGC AATAAGTGTT TCGTGGAATA ATGAACACGA AACAAAATTT
ACTCAAACAC ATGATTTGTT TAGACCCTAT GAAAAGTCTG CTCAAGAAAA ATTCAGATTA
TTTATTGAGA AGAAAAATAG TTCAATAAAA TATTGGAACG ACAGGTGGAA TACTTCCTTT
GATTGTTTTG CAGAAGTAAC TTTACCCACA TTACGATGTG AGAACAAAAA ACAAATTGAG
CAATTTCAAC TCCAGGGGCA ATTTATATTT GATTTTTACG CATTTAGAAG ATTTTTGCTT
GTTAACATAT ATCAAGGTTG TTGTGAACGA ATAAAATCTC ATCAGTATAA AACATGGCTG
CATTTTGGTG AAATATTTAC CTGTATTGAT GCAATATATC AAGGTGATGT TGTGTTTGAT
ATGATCACAG AAAATTGGTT GGATATCGTG GTTATAGATT CAAATTTATC TAAGATAGGG
GTTGAAACAA ATGACCCGTT CGTTTCTTAT GTAATTGTTT CTGCTTGTCA GCAATACGAT
AAACCAGTGA TATTTGAAGT TGCGGTGGAG CGCGATAAAT CAATTGAGGT GTATAAAGAA
AGTATTCTTT GGGCACAAAA AAAGAATGTA GACGGGCTAG GACATACAAA TTTTATGAAC
CGGAAAAAAT TTCATTGTCT TTTTCAGAAT TCATCTCTTG CCGCAGGTAT GGATGGTGTT
GAGTCAACAA AACATTTGCT TTTGATACAT CCTATTCGTG GATGTGGTGT ATTGAGGCAA
AGACAACATA TTGTGGGAGC AATATCAGTA GATCCTGTTC AAGATTATTT GATGGGTGAA
GTTAAGCACT GGTATGACAG AGGATTTAAT GTAGATATTA TCGGAGATCC TTTACTATTA
AATAAGATTA ATCAAGATGT TTATAATGAA ATAGTTTATT TGGAACCATT GGCGTTATTG
TCAGTTGACA AGAAGCATAT TTCAGATTTT GTGGAAAGGT TGCCGAGTGA CAAACCGTAT
TATCATAAGA AGTTGGATCG AGCATTCTTT GAACATCAAA CCGGGTTTGA TGGTGAACCT
TTGGAATTTT TATCTTTTGA AGAATAA
 
Protein sequence
MKKILYIHIG TGKTGTSSIQ NFLSENQKNG RLSKHFNTRY CFSGRPNIHN ILLCKNYQRG 
NEKTQELIDA KIRELRKEIY QSSEQLFIIS SEYFPGNTQD EIHELVNTVS DICQVKIIVY
LRRQDDFLWS WYTQLVKTTN IFLDIYKLKD KLYKGPQQVF NYETSLRKWE NIVGKENILV
NIYDRNNFKE NSIYFDFIHN LNIDLDLKKM IVPEEVNLSL QCEQICLIRT LIPYVSEWND
EQLKMLQKPL NIPFLYTKSL LSPRERKEIL NDFRQHNEFV ARRYLNQENL FDENIEDEDT
WREPNIFSSG YFEKYIDYIK TEKKTLFELL QPILKNVNRM KIKTVRNMLL VNNIFIVDWT
EKAGCTIVCK MFFDVMGILQ EALDYNSWIH NYRQDKFYEK FGKVTEDMLS GDKFVKMKFV
RNPYTRAVSS YFAANENNKQ YLKGYEHLDL SFYDFLLLVK QKKIVNDHWR TQYQEIESRF
NFDEIIKIEN IEQEVKRLNQ KYSLNLQLHT HSSHHHKKDK NKSEFVGRKK YSELEISGTI
PDYRNFYDDE TKLLVSEIYE KDLKTYKYTF DLEKNQERIK AENTDKDKKM KHNINYITIW
ENEWNSVYAD EFLIKAANSD INTVAFDMRW HKHETNEGNF DFSHTQDRCD KLIEHGFKLM
PLISIFYCPN WVHEKYPDIV ELNETFGAIG SGHKGVSSAY EKSLPLALNF IDKCIDALEC
YRQDITAISV SWNNEHETKF TQTHDLFRPY EKSAQEKFRL FIEKKNSSIK YWNDRWNTSF
DCFAEVTLPT LRCENKKQIE QFQLQGQFIF DFYAFRRFLL VNIYQGCCER IKSHQYKTWL
HFGEIFTCID AIYQGDVVFD MITENWLDIV VIDSNLSKIG VETNDPFVSY VIVSACQQYD
KPVIFEVAVE RDKSIEVYKE SILWAQKKNV DGLGHTNFMN RKKFHCLFQN SSLAAGMDGV
ESTKHLLLIH PIRGCGVLRQ RQHIVGAISV DPVQDYLMGE VKHWYDRGFN VDIIGDPLLL
NKINQDVYNE IVYLEPLALL SVDKKHISDF VERLPSDKPY YHKKLDRAFF EHQTGFDGEP
LEFLSFEE