Gene Moth_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0215 
Symbol 
ID3831366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp211573 
End bp212673 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID637828151 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_429093 
Protein GI83589084 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03609] polysaccharide pyruvyl transferase CsaB 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAGGG TAGTAATCTC CGGTTATTAC GGTTTTCAGA ACGCCGGGGA CGAGGCGGTC 
CTCTACAGCA TCGTTAAAGC CTTGCGCTCC CTGGAACCGG ACATAGAGAT CACTGTCCTT
TCCCGGCGTC CGGAGCAAAC GGCGGCCTGT TTAAAGGTCC GGGCAGTAGA CCGCTGGCAC
CCGGTCCGGG TGGCCGGGGC TATCCGCCGG GCCGACCTGG TTATCAGCGG TGGGGGTAGT
TTGTTCCAGG ATGTTACCGG GCCTAAAAGC CTGCTCTATT ACCTGGGTAT AGTGTTGCTG
GCCCGGTTAT TGCGGAAGCC GGTAATCGTC TATGCCCAGG GACTGGGCCC TTTGAAACGC
CACTGGAGCC GTTGGCTGAC GGGCCGGGTA TTAAACCGCG TCCAGCTTAT CTCCCTGCGG
GACAGCGAGT CACGGCGGCT GCTGGAAGAG CTGGGGGTTA CCCGGCCGCC GGTCTATGTA
ACGGCCGACC CGGTCCTGGG CCTGGAGCCT GAAAATATGG ACCTGCGGCC CGGCCAGGAT
AAATGGGAGC AACTGGAGCT TTCCGGGCCG GTGATCGGGA TTTCAGTGCG CTCCTGGCCG
GGGTATGAGG AGTGCTGGCC GTCCCTGGCC AGGGTAGCTG ACGAGCTCGT GGCCGGGGGG
TGGCAGGTTT TATTTTTGCC TTTTCACTTT CCCGCTGATG TCGACGCCTG CCGCCAGGTA
GCCCGCCTGA TGCACAGTCC GGCAGTTGTT CTACGGGAAA ACCTGGACCT GCCGGCCCTG
ATGGGCCTGA TGGGCCGGTT GCAGTTTTTG ATCGGCATGC GTCTCCACGC CCTGATCCTG
GCTTCTTTGA TGGGGGTTCC TTTCCTGGCC CTGCCCTATG ACCCCAAAGT GACGGCCCTG
GCCAGGATGA TGGAGCAGCC GGTCGCCGGC TTTCTGGCGA GTGTCAGTTA TACCGGCCTG
GAAGCAGCCG TCAAACAGGC CCTGGCCGAA CGTGAGGAAA ACGCCCGGCG GGTACAGGCC
GCGGTGGCCG AACTGCGGCC TCTGGCCCTG GACACCGCCC GGCTGGTAAT AGAGTATTTG
CGGAAAGGAG CAAGGGGCTG A
 
Protein sequence
MARVVISGYY GFQNAGDEAV LYSIVKALRS LEPDIEITVL SRRPEQTAAC LKVRAVDRWH 
PVRVAGAIRR ADLVISGGGS LFQDVTGPKS LLYYLGIVLL ARLLRKPVIV YAQGLGPLKR
HWSRWLTGRV LNRVQLISLR DSESRRLLEE LGVTRPPVYV TADPVLGLEP ENMDLRPGQD
KWEQLELSGP VIGISVRSWP GYEECWPSLA RVADELVAGG WQVLFLPFHF PADVDACRQV
ARLMHSPAVV LRENLDLPAL MGLMGRLQFL IGMRLHALIL ASLMGVPFLA LPYDPKVTAL
ARMMEQPVAG FLASVSYTGL EAAVKQALAE REENARRVQA AVAELRPLAL DTARLVIEYL
RKGARG