Gene Moth_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2359 
Symbol 
ID3832539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2480208 
End bp2481167 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content49% 
IMG OID637830279 
Productglycosyl transferase family protein 
Protein accessionYP_431185 
Protein GI83591176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00260932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGTA AAGTGCGTTA TACCATTATC ATTCCGGTCT ATAACGAAGA AGACGTTATT 
CGTGAAACCT ATCGCCGGCT AACCCTGGTC ATGCAATCCC TCGGTGAACC GTATGAATTG
CTGTTCGTCA ACGACGGTAG CGAGGATCGG ACGGCGGAAA TAATCGAAGT TTTAGCGGAA
ACGGACGATA GCGTGAGGCT ACTGAATTTC TCGCGCAATT TCGGGCATCA AATAGCGATT
ACCGCGGGCA TGGATTATGC CCGCGGGGAC GCCATCGTAA TTATCGACGC TGATTTGCAG
GACCCGCCCG AGCTAATCCC GCGAATGATT GAGAAATGGC AAGAAGGATA CGAAGTCGTC
TATGCACGGC GCGTTCAGCG GAAGGGGGAG ACGTTGTTTA AAAAATGGAC CGCTTCTTTG
TTCTATCGTA CCCTTCGCAT GATGACAGAA GTCGATATTC CCCTGGATAC CGGTGACTTC
CGTCTGATAG ACCGGAAAGT GTGTGATGTC ATGCATAGCA TCCGGGAGAA AAGCCGCTTT
ATTCGCGGCC TGATCAGTTG GATAGGCTTT CGCCAGGCAG CCATTGAGTA CATCCGGGAG
GAACGCTTTG CCGGAAAAAC AAAATACCCG CTGAAAAAAA TGCTGCGCTT AGCAATAGAC
GGGATCACCT CTTTCTCCCA TAAACCTTTG AAATTGGCCA CATACCTCGG TTTGGCCCTC
TCTTTGCCAA GTTTTGCCTA TCTGGTTTTC TCTCTGGGGT TAAAAATATT CACCGCCAGC
ACAATCTCCG GGGGAAGATG GCTTTTTACC CTCCTGCTGT TGCTAAACGG TGTGAACTTC
ATCCTATTAG GGATCCTGGG AGAGTATATC GGCAGAATTT ACGATGAAAC GAAAGACCGG
CCGCTATATA TTCTACGCAA CAAGCAGGAA GCAGAAAATT TATTAGTTCG AAGGGGGTAA
 
Protein sequence
MKSKVRYTII IPVYNEEDVI RETYRRLTLV MQSLGEPYEL LFVNDGSEDR TAEIIEVLAE 
TDDSVRLLNF SRNFGHQIAI TAGMDYARGD AIVIIDADLQ DPPELIPRMI EKWQEGYEVV
YARRVQRKGE TLFKKWTASL FYRTLRMMTE VDIPLDTGDF RLIDRKVCDV MHSIREKSRF
IRGLISWIGF RQAAIEYIRE ERFAGKTKYP LKKMLRLAID GITSFSHKPL KLATYLGLAL
SLPSFAYLVF SLGLKIFTAS TISGGRWLFT LLLLLNGVNF ILLGILGEYI GRIYDETKDR
PLYILRNKQE AENLLVRRG