Gene Tery_4954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4954 
Symbol 
ID4246608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7547130 
End bp7548305 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content38% 
IMG OID638109765 
Productglycosyl transferase family protein 
Protein accessionYP_724341 
Protein GI113478280 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00767556 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCC TTGTAGAAAT ACTTCAAATA ATTTGCCTCA TACCCATCAT TAGTGGGTGT 
ATATATTTAA TCTTAAGTGT CTGGACAATT AGAAATTTTT TCAGAAAAAC CACTACAGAA
ATAAATGCCA CAGAAAAATT TCAGCCTCCT GTAACTGTAC TTAAACCAAT ACGGGGGATC
GAAAAAAACC TGAAGTCAAA TCTGCATACT ATTACTATTC AAGATTGGCC AGAGTATCAA
GTAATTTATT CTATTCAAGA CCCTCAAGAT TCAGCTCTCC CTATTCTTGA CGAGCTTCAA
GCAGAAGTAG ACAACCAGAA AATTTCCGTT GTTATTGACA ATAAACAAGC AGGAGCTAAT
GGCAAGGTTA ATAACTTACT TGGTGCAATA GCACAAGCAC GTCATCAGAT TATTATTATT
AGCGATAGTG ATACTAATCT TAAACCTGAC TATATCAAAA ATATCATATC TCCTTTATCA
AATCCTAATG TGGGAGCTGT CTGCACTCTC TTTAAAGTCA AAAGTGCTTA TAGATGGTTT
GAGAAGATGG AATTGTTAAC AATAAATGCT GACTTTATTC CTAGTGTTAT ATTCGCAGCA
GTCACGGGAG CATCCAATGC TTGTTTGGGA CCCTCGATCG CTATAAGTCG CAGCACATTA
CAAGAACTGG GTGGCCTTGA GAGTCTGGCA GATTATCTTG TAGAAGATTA TGAATTAGGA
CGGAGGGTAT GGACTTCTGG AAAAAAAATG GTGCTTTTGC CATATACTAT TGATGTGACT
GTAGACTTAA AGAATTGGCA AGAGTGGTGG ACTCATCAAG TCTATTGGGA TCAGAATACA
TATTTGGCAC GTCCCTGGCC TTTTATTGCA ACTATATTAA TCCGGGCAGT ACCTTTTGCT
ATTTTGTTCG CTATAGTGAG AATGGGGGAT TTACTCGGAT TAGGAGTATT AGGGTTTACT
TTGGCTTTAC GACTTTTCAG CGCTGGGATA ACTTTAAAAG AGTTGAAAGA TGTAGAAGGT
TTTCAAAGTC TTTACTTATT ACCTTTACGT GACACTTTCG GTTTAATATT TTGGTTTTTG
GCGTTGACTA AGCGTACAGT GGTATGGAGG GGTGTTAAAT ACAAATTGGT CGATCATGGA
AAAATGGTTC CCGTCAGCAA AGGAGTAGGG AATTAG
 
Protein sequence
MPTLVEILQI ICLIPIISGC IYLILSVWTI RNFFRKTTTE INATEKFQPP VTVLKPIRGI 
EKNLKSNLHT ITIQDWPEYQ VIYSIQDPQD SALPILDELQ AEVDNQKISV VIDNKQAGAN
GKVNNLLGAI AQARHQIIII SDSDTNLKPD YIKNIISPLS NPNVGAVCTL FKVKSAYRWF
EKMELLTINA DFIPSVIFAA VTGASNACLG PSIAISRSTL QELGGLESLA DYLVEDYELG
RRVWTSGKKM VLLPYTIDVT VDLKNWQEWW THQVYWDQNT YLARPWPFIA TILIRAVPFA
ILFAIVRMGD LLGLGVLGFT LALRLFSAGI TLKELKDVEG FQSLYLLPLR DTFGLIFWFL
ALTKRTVVWR GVKYKLVDHG KMVPVSKGVG N