Gene Tery_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1522 
Symbol 
ID4241709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2310958 
End bp2312223 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content34% 
IMG OID638106669 
Productglycosyl transferase, group 1 
Protein accessionYP_721279 
Protein GI113475218 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.266558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0714001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTC TTATCTATTC CTACAACTAT AACCCAGAAC TAATTGGTAT TGCACCCCTA 
ATGACAGAAC TTGCAGAAGG TTTTGCTAAA CGTGGACATC AAGTTCGTGT AGTGACTGGT
ATGCCAAATT ATCCTGAACG TAAAGTTTAT GATGGATATA AAGGGAAGTT TTTTCTGACA
GAATACAAAA ATGGTGTGAC AGTTCAACGT AGTTATATTC ATGTCAGAGG TTCTAAACCG
GGGGTTTTAG CTCGGCTATT GTTAGATGGT AGTTTTATTG TTTCTAGTTT ATGGCAAGCT
TTTAATGGCT GGAAACCTGA AATTATTTTT GCTACTACTC CTCCTATATT AATTTCTTTA
CCAGTTAGTT TTTACGGATT ATTTTCTAAA TCTTCTGTGG TCTTAAATAT TCAGGATATT
GTTTCTGAAG CTGCGGTAAG AGTTGGGCTA GTGAACAAAA ATAGTTGGAT AGTAAGTCTA
GCTCAAGCAG TGGAAAAATT AGCTTATTTT AAAGCTGATA AAATTAGTGT TATTACTGAA
GATTTTGTTA CTAAGTTGGT AGAACAAGGG GTTAGTAAAG ATCGTATTGT TTGCATTTCT
AATTGGGTAG ATATTAATTT TATTAGACCT CTCAATAAAA ATAATTATTT CCGAGCAGAG
CATAACCTTC AAGATAAGTT TGTAGTGATT TATTCTGGGA ATATTGCTTT AACTCAAGGT
TTAGAAACAG TAGTTAAAGC TGCTGCTAGT CTCAAAGAAA AATCAGAGAT TTCTTTTGTC
ATCTTAGGAG AAGAAACAGC GCGTCAACAA TTACAAGAAT GTTGTAATAA TTATCAAGCA
GATAATATAC TTTTACTGCC TTTAGTACCA CGAGAAAAAT TACCAGAAAT GTTATCTGCT
GCTGATGTTG GTTTAGTAAT ACAGAAAAAG ACTGTTACTG CTTTTAATTT ACCTTCAAAA
ATTCCTGTTA TTCTTGCTAG TGGTCGTCCC ATAATAGCCT CTGTTCCAGA TACAGGTACA
GCAATGAGGG TTGTCAAAGA AAGTGGTGGT GGAATAGTGG TTACTCCAGA AGATTTTTCA
GCTTTGGCAC AAGCCATTCT AGAGTTATAT GAAAATCCAA AAAAACTTGA GGAACTAGGT
CAGCAAGGGA GAAAATATGC TGAAGAAAAT TTTGGATCTA AAAATGCTTT AAATAGTTAT
GAAGCTTTAT TTGCTGAAAT TTTATCCTGT CAGGAAAGAG GAGAAAAGAA GAAAGGAAAA
GGTTGA
 
Protein sequence
MRILIYSYNY NPELIGIAPL MTELAEGFAK RGHQVRVVTG MPNYPERKVY DGYKGKFFLT 
EYKNGVTVQR SYIHVRGSKP GVLARLLLDG SFIVSSLWQA FNGWKPEIIF ATTPPILISL
PVSFYGLFSK SSVVLNIQDI VSEAAVRVGL VNKNSWIVSL AQAVEKLAYF KADKISVITE
DFVTKLVEQG VSKDRIVCIS NWVDINFIRP LNKNNYFRAE HNLQDKFVVI YSGNIALTQG
LETVVKAAAS LKEKSEISFV ILGEETARQQ LQECCNNYQA DNILLLPLVP REKLPEMLSA
ADVGLVIQKK TVTAFNLPSK IPVILASGRP IIASVPDTGT AMRVVKESGG GIVVTPEDFS
ALAQAILELY ENPKKLEELG QQGRKYAEEN FGSKNALNSY EALFAEILSC QERGEKKKGK
G