Gene Moth_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1894 
Symbol 
ID3831167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1958467 
End bp1960245 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content60% 
IMG OID637829827 
ProductGTP-binding protein TypA 
Protein accessionYP_430737 
Protein GI83590728 
COG category[T] Signal transduction mechanisms 
COG ID[COG1217] Predicted membrane GTPase involved in stress response 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR01394] GTP-binding protein TypA/BipA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00474713 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGC AAAAGATACG CAACCTGGCG ATAATCGCCC ACGTCGATCA TGGCAAGACG 
ACCCTGGTCG ACGGCATGCT GAAACAGAGC GGCATTTTCC ACGAGAAGCA GGTGGTCCAG
GAGCGTATCC TGGACCGCAA CGATCTGGAA CGGGAACGCG GCATCACTAT CATGGCCAAG
AATACCGCTG TCTTTTACCG GGGTTACAAG CTGAACATCG TCGACACCCC CGGCCACGCC
GATTTCGGCG GCGAAGTGGA GCGCATCGTC CAGATGGTGG ACGGGGCCCT CCTACTGGTG
GACGCCTTCG AGGGCCCCAT GCCCCAGACT CGTTTTGTCC TGAAAAAGGC CCTGGCGGTG
GGTCTGAAAC CCATTGTGGT CATCAATAAA ATGGACCGGC CCAACGCCCG GCCGGGGGCG
GTGGTCGACG AAGTCCTGGA CCTTTTCATC GACCTGGGAG CCACCGAGGA GCAGCTGGAT
TTCCCGGTGG TCTACACAGT AGCCCGCCAG GGAACGGCCA GCCTGGACCC GGACCAGCCC
GGGAAAGACC TGCAGCCTCT GTTTGACATA ATCGTCCAGC ATATACCAGC CCCCGGCGGG
GACCCGGAGG CAACCCTGCA GGTAGGAGTC AACCTCATTG ATTATGACAC TTATGTCGGC
CGCCAGGCCA TCGGCCGGGT ATATAACGGC ACCATCCGCG CCCGGCAGGA AGTGGCCGTT
GCCCGGCCCG ACGGCAGCCT GGTCCGGGGG CATGTGGCTG CCCTGCATGT TTTTGAAGGT
CTCAACAAGG TGCCGGTGGA TGAAGCCGCC GCCGGGGAGA TCGTCGTCGT CAGCGGCCTG
GAGGACATCA ACGTCGCCAA TACCATTACC TCGCCGGAGG ACCCCCGGCC CCTGGACTTT
GTCCGTATCG ACGAGCCCAC GGTGGCTATG ACTTTCATGG TCAATAAGAG CCCCTTCGCC
GGCCGGGAAG GGGAGTATGT AACTTCCCGG AAGCTCCGGG AGCGCCTCCT CAGGGAGGCG
GAATCGGATG TCAGCCTGCG GGTGGAGGAA ACGGATTCCC CTGACGCCCT GCTGGTTTCC
GGCCGGGGCG AGCTGCACCT CGCCATCCTC ATCGAAACCA TGCGCCGGGA GGGGTATGAA
TTCGAAGTCT CCCGGCCCCA GGCGATTATT AAAGAAATAA AGGGCGTCAA GTGCGAGCCC
GTTGAAGAAC TCATTATAGA GGTTCCGGAA ACCTATATGG GCATCGTCAT CGAGCGCCTG
GGTCCCCGCA AAAGCGAGAT GGTCAACCTG GAAAACAAGG GGGACGGCCA GGTGCGCCTG
ACCTTTCATA TCCCCACCCG GGGGCTCTTC GGCTTCCGTT CCGAATTCCT TACCGATACT
AAAGGTCTGG GCATCATGCA CCACGCCTTT CACCATTACG CCCCCTATGC CGGGGAGATT
GCTACCCGGA CGCGTGGTTC TCTGGTGGCC TTTGAAACCG GGGAAACAAC CAGCTATGGC
CTGGAGAACG CCCAGGAGCG GGGCGAGCTC TTTGTCGGCC CTGGGGTACC AGTCTACCGG
GGGATGATTG TCGGCGAGCA TTCCCGGCCC GGCGACCTGA TGATCAACGT CTGCAAAAAA
AAGCAACTGA CCAACGTCCG CAGTTCTACC GCCGATATTG CTATCAAACT GGTCCCGCCC
CGGGAGATGA CCCTGGAGCA GTGCCTGGAA TTTATCGCTG CCGACGAACT CCTGGAAGTG
ACGCCCAGGT CCCTCAGGAT GCGAAAGAGG GATATATAA
 
Protein sequence
MDQQKIRNLA IIAHVDHGKT TLVDGMLKQS GIFHEKQVVQ ERILDRNDLE RERGITIMAK 
NTAVFYRGYK LNIVDTPGHA DFGGEVERIV QMVDGALLLV DAFEGPMPQT RFVLKKALAV
GLKPIVVINK MDRPNARPGA VVDEVLDLFI DLGATEEQLD FPVVYTVARQ GTASLDPDQP
GKDLQPLFDI IVQHIPAPGG DPEATLQVGV NLIDYDTYVG RQAIGRVYNG TIRARQEVAV
ARPDGSLVRG HVAALHVFEG LNKVPVDEAA AGEIVVVSGL EDINVANTIT SPEDPRPLDF
VRIDEPTVAM TFMVNKSPFA GREGEYVTSR KLRERLLREA ESDVSLRVEE TDSPDALLVS
GRGELHLAIL IETMRREGYE FEVSRPQAII KEIKGVKCEP VEELIIEVPE TYMGIVIERL
GPRKSEMVNL ENKGDGQVRL TFHIPTRGLF GFRSEFLTDT KGLGIMHHAF HHYAPYAGEI
ATRTRGSLVA FETGETTSYG LENAQERGEL FVGPGVPVYR GMIVGEHSRP GDLMINVCKK
KQLTNVRSST ADIAIKLVPP REMTLEQCLE FIAADELLEV TPRSLRMRKR DI