Gene Tery_4771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4771 
Symbol 
ID4246425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7329873 
End bp7332980 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content33% 
IMG OID638109621 
Productglycosyl transferase family protein 
Protein accessionYP_724197 
Protein GI113478136 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAA AAGTAAGCGT AATTATTCCT GTATATAACT GCGAACTCTA CATCGCTCAA 
GCAATTGAAA GTGTCTTGAA TCAAACTTAT ACTGACTATG AAATTATTGT TATAAATGAC
GGTTCTACAG ATAATACCCA TCAGGTATTA CAGCCATACA TGAAAAAAAT TCGTTATTTT
TATCAAGAGA ATAAAGGATT ATCTGCTACT CGTAACCAAG GGATAAAAAT GGCAAAAGGA
GAATTAATTG CTCTCCTGGA TGCTGATGAT TTATTTCTCT GTTATAAACT TCAAGAACAA
GTAGCTATTT TTGATGCTCA ACCAAACATA GGTTTAGTGC AAAGTGGGTG GCGAGTTGTG
AATGAAAAAG GAGAAAAAAT TGAAGATATT GAACCTTGGT ATAAGTCTCC AAAATTAGAC
TTAGTAAGTT GGTTAAAATG GAAAGCTACT AATCCTAGTG CTATGATGTT TCGGAAAGAA
TGGTTGGAAA AAGTAAATGG GTTTAATGAA AATTTACGCA GACTAGAAGA TTTTGATATT
GTAATTCGAT TAGCTTTAGC AAGTTGTCAA GCTACTTGGT TTCCCAAAGT TGCTGTCTGT
TATCGTCAAC ATAGCGGTAA TATGACTCGG AATTTACTTG CTCAAACAGA AGTAGAAGAA
AAAATTTTAG ATGAACTTTT TTCTAATCCT AATCTCCCTG AAAAAATTCA ACATTTAGAA
AGAGAATTAC GCTACGGTTC TCTAATTTGG AATTGTTGGT GTTTATATAA ATCTGGCTAC
TTCGCTGAAA TGGCTGATTA TTTGCATAAA TCTTTAAAAT TTTCTCCTGT TTCTGTACTT
AAAACTATAT GTGAGTGGCT ATATTATTTT CACTATTATG CTTCCTTAGA AAACTATTCA
TTTAATTCAG TTTCTTTGAA TAAAATACCT GAGTGGCAAC TTTTAATAAA TCAAATCATG
CCAAATGGTT TACCAGTAGT TAGCGTAGTT ATTCCGACTT TCAATAATGC TGATTATATT
CTTAAAGCTA TCGAAAGTGT TTGTAATCAA ACCTATACTT TGTGGGAAAT TATTATTATT
GATGATGGTT CTACCGATAA TACTTATCAA GTTTTAGAAT CTTATCTAAA TACAACAGAC
TTCTTGCAGA ACTCCGCCGC AAGAAATAGA CAACAGAAAA CAAAAATAAA TACTCAGGAA
AAAGAGAATA ATCAAGATTT AATTTTAGAT TTATTCCAGA ATTCTGCCAA AGGAAATAGA
CAACAAAAAA TCGAAAAAAA TACTCAGGAA AAAGGCAGTA GTCAAGATTT AACTTTGACT
TTTGCAAGAT GTCTAATTAA ATATCTATAT CAAGAAAATC AAGGACCATC AATTGCTCGG
AATTATGGTA TAGAAATAGC TCAGGGAGAA TATATTGCTT TTCTCGATGC TGATGACTTT
TTTCTACCCG AAAAATTAGC TGAACAAGTG GGATGTTTTG CAGAAGATTC AACTTTAACA
ATGGTACAAA CTGGGTGGAG ATTTGTTGAT GAGAAAGGTA ATACTATTAA AGATGTAGAA
CCCTGGGAGA ATAGTACGGA ATTAAACTTA GAAAATTGGT TGATTTGGCA AGCAACTTTG
CCTAGTGCCA TGATGTTTAA AAGTAAATGG TTAAAAGTTA ATGGTGGGTT TGACAAACGA
TATTTTGGGA TTGAGGATTT AGAAATAGTT TTGCGGTTAG CTTTAACCGG AGGAAAAGCT
ACTTGGTTAC GGAAAGTTTG TGTCTGTTAT CGACAGCGGA ATAGTAGTGT CTCGGGTTTA
AAAAACCGAG AAAAAATTAC TCAGGAATTA GAAAAATTAT TAGATGACTT TTTCCATAAA
CCCAATTTAC CTACTGCCAT ACGTCAAATA GAAAATCAGG TAAAATATCG CTCTCTACTT
TGGAATTGTT GGCGTTTATA TAAAGCTGGT AATTTATCAG AAATGACTAC TTATTTGCAG
AAGTCTTTGA GTTATACTTC TGATTCTCCC ACAGTAGCAA TTTGTACTTG GATAGAATTA
TTTACGAAGC TAGATAGTCA AGAAGGAAAT CAATTTAATG CTTATAGTTT TAGTCAGTTA
CCAGAATGGA AAAAATTAAT TAGTAATATT TTTAATCCCG TTGTGCCAAG AGTTAGTGTA
ATTATTGCAA CCTACAATAA TGCTCACTAT ATTTTGGAGG CGATCGCCAG TATTTTTAAT
CAAACTTATA CCTCTTACGA AATAATAGTT ATTGATGATG GTTCTACTGA TAATACTCGT
CAAGTTTTAG AACCTTATCT AGATAAAATT TGCTATGTTT ATCAAGAAAA TAAAGGAGTT
TCCCACGCCA GAAATTTAGG CTTAGAAATA GCCCAAGGAG AATTTATTTC TTTTCTAGAT
GCTGATGACT TTTTCTTACC AGATAAATTA GCAAAACAAG TAGCTGTTTT TGATGCTCAC
CCCTCTCTGG GAATTGTCCA TAGTGGATGG CGTTTAGTTA ATAAAAAAGG AGAGAAAATT
TCTGATATAG AGTTATGGCA TAGTTCCCCA GAATTAGACT TAGAAACCTG GGTAGTTTGG
AAACCTGTAA CTATTTCTAT GATGTTTAGT AAAAGTTGGA TAAAAAGTGT CGGAGGTTTT
GATACTCGTT GGCATCATGG AGAAGATATT GATTTAGTTT TGCGTTTATC TGTGGATGGT
TGTGAAGCAA TGTGGTTGCC AAAAGTTACT TATTGTTATC GTCAACATCA TTGTAATGCA
ACGAGAAAAT CTACTCAACA AGCTGCTAGC ATGATGTCTG TTTTAGATAG CTTTTTTAGT
AGACTAAATT TACCAATATC TATTCGTAAA TTAGAGAGCC AATCTTATTA TTATACTTTG
AGTTGGCTAG GATGGCAAGC CCATCGAAAT GGGAGTTTAA TTGAGATGGA TAAATATTTA
CGACAAGCTT GGAAATATAC TCCTTTCTCT ATGACAGAAA CTATCGATTA TTGGGTCAAT
AGTTTTAATA ATCTTTATCA ACAATATGGT TATGATTTTA ATGCTTATGC TTTGACTAAT
TCAGATGTTT GGCAACAATT AATGTTAGAC ATGGTTATTA CTCTTTGA
 
Protein sequence
MNPKVSVIIP VYNCELYIAQ AIESVLNQTY TDYEIIVIND GSTDNTHQVL QPYMKKIRYF 
YQENKGLSAT RNQGIKMAKG ELIALLDADD LFLCYKLQEQ VAIFDAQPNI GLVQSGWRVV
NEKGEKIEDI EPWYKSPKLD LVSWLKWKAT NPSAMMFRKE WLEKVNGFNE NLRRLEDFDI
VIRLALASCQ ATWFPKVAVC YRQHSGNMTR NLLAQTEVEE KILDELFSNP NLPEKIQHLE
RELRYGSLIW NCWCLYKSGY FAEMADYLHK SLKFSPVSVL KTICEWLYYF HYYASLENYS
FNSVSLNKIP EWQLLINQIM PNGLPVVSVV IPTFNNADYI LKAIESVCNQ TYTLWEIIII
DDGSTDNTYQ VLESYLNTTD FLQNSAARNR QQKTKINTQE KENNQDLILD LFQNSAKGNR
QQKIEKNTQE KGSSQDLTLT FARCLIKYLY QENQGPSIAR NYGIEIAQGE YIAFLDADDF
FLPEKLAEQV GCFAEDSTLT MVQTGWRFVD EKGNTIKDVE PWENSTELNL ENWLIWQATL
PSAMMFKSKW LKVNGGFDKR YFGIEDLEIV LRLALTGGKA TWLRKVCVCY RQRNSSVSGL
KNREKITQEL EKLLDDFFHK PNLPTAIRQI ENQVKYRSLL WNCWRLYKAG NLSEMTTYLQ
KSLSYTSDSP TVAICTWIEL FTKLDSQEGN QFNAYSFSQL PEWKKLISNI FNPVVPRVSV
IIATYNNAHY ILEAIASIFN QTYTSYEIIV IDDGSTDNTR QVLEPYLDKI CYVYQENKGV
SHARNLGLEI AQGEFISFLD ADDFFLPDKL AKQVAVFDAH PSLGIVHSGW RLVNKKGEKI
SDIELWHSSP ELDLETWVVW KPVTISMMFS KSWIKSVGGF DTRWHHGEDI DLVLRLSVDG
CEAMWLPKVT YCYRQHHCNA TRKSTQQAAS MMSVLDSFFS RLNLPISIRK LESQSYYYTL
SWLGWQAHRN GSLIEMDKYL RQAWKYTPFS MTETIDYWVN SFNNLYQQYG YDFNAYALTN
SDVWQQLMLD MVITL