Gene Tery_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1372 
Symbol 
ID4245472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2104828 
End bp2106021 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content31% 
IMG OID638106545 
Productglycosyl transferase family protein 
Protein accessionYP_721156 
Protein GI113475095 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.704698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAAA AATTTCTCTT AATAACAGTA ATTTCTAACT TCATAATATG GATATACTTA 
TTAATATTTC GCGGTAATTT TTGGCTGGCA AACCAACAAC TATTATCAAA ATCAAAAACA
AAAATTGAGA ATACGGAAAA TTTGCCATCA ATTTATGTGA TAATTCCTGC TAGAAATGAA
GAAAAATTAC TAAAAATTAC CCTAAATTCT TTATTAAATC AAGATTATTC AGGGATATTA
AAAATAATAT TAGTAGATGA TCACAGTAAA GATAATACAA TCAATATAGC GAATTCTTTG
GCTCAACAAG GTCATAATTC TACAAAGCTA GAAGTTATTT CAGCAGCAGA TTTACCTAGT
AATTGGACAG GAAAACTATG GGCAATTAAT GAAGGAATTA ACTATGCAAA AAAACAAACT
CCAGCCCCAG ATTATTTTCT ATTAACAGAT GCAGATATTG AACATTTCCC TACTAATATT
CGCCAACTTG TTGTCAAAGC AGAACAAGAA AATTTAGCCT TAGTTTCTTT AATGGTAAAA
CTACAATGCG AAACAATAGC CGAGAAATTA ATGATTCCCG CATTTGTATT TTTCTTTCAA
AAGTTATATC CATTTAAATG GGTAAATAAT CCCCAAAATA CTACTGCAGC TGCTGCTGGA
GGTTGCATAT TAGTTCGTCA TAAAAATTTA GATCAAGTTG GAGGAATAGA GGTTATTAAA
AATGCTTTAA TAGATGATTG TAATTTAGCT AAAATAGTTA AACAAAAATC CACAAATAAA
AATATCTGGT TAGGGCTAAC TAATGATACG AAAAGCCGAC GTTCTTATCC TGATTTAATG
AGTATTTGGA ATATGGTAGC TCGTACTGCT TTTACTCAGT TAAATTATTC TCCATTCTTG
TTATTAGTAA CAGTAATAGG AATGAAATTA GTTTATTTAA TTCCCTCATT AGGAATAATT
TTGGGAGTTA TTTTTGGTTG GTGGCCAGTA GTAGTGATCG CGATCTTAGC AAGATTATTA
ATATTTTTAG CTTACTTACC TATTATTAGA TTTTATGGAC TTTCACCAAT ATATGCCATG
AGCTTACCCA CTGTTGCTTT GATTTATATA TTAATCACAA TAGATTCAGC TTGGCGACAC
TGGCGAGGGC GAGGCGGTTA TTGGAAAGGA CGAGTTAATA CCAGTATATT CTGA
 
Protein sequence
MFQKFLLITV ISNFIIWIYL LIFRGNFWLA NQQLLSKSKT KIENTENLPS IYVIIPARNE 
EKLLKITLNS LLNQDYSGIL KIILVDDHSK DNTINIANSL AQQGHNSTKL EVISAADLPS
NWTGKLWAIN EGINYAKKQT PAPDYFLLTD ADIEHFPTNI RQLVVKAEQE NLALVSLMVK
LQCETIAEKL MIPAFVFFFQ KLYPFKWVNN PQNTTAAAAG GCILVRHKNL DQVGGIEVIK
NALIDDCNLA KIVKQKSTNK NIWLGLTNDT KSRRSYPDLM SIWNMVARTA FTQLNYSPFL
LLVTVIGMKL VYLIPSLGII LGVIFGWWPV VVIAILARLL IFLAYLPIIR FYGLSPIYAM
SLPTVALIYI LITIDSAWRH WRGRGGYWKG RVNTSIF