Gene Moth_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2519 
Symbol 
ID3832574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2625427 
End bp2626815 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content55% 
IMG OID637830442 
ProducttRNA modification GTPase TrmE 
Protein accessionYP_431344 
Protein GI83591335 
COG category[R] General function prediction only 
COG ID[COG0486] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00450] tRNA modification GTPase TrmE 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00190608 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTTGACG ATACCATTGC CGCCCTGGCG ACACCGCCTG GTGAAGGCGG TATTAGCATA 
ATTCGCCTGA GCGGCAGCCA GGCCATTGCT ATAGTAGCAA AGGTTTTTAA ACCGGTCAAG
GGACCTGATT TAACCACAAC CAGGAGTCAT ACCCTGCGGT TGGGATTCAT AATTGATCCT
GTTTCTGGGG AAAGTCTGGA CGAGGTTCTG GTTAGTGTCA TGCGGGCTCC TCATAGTTAT
ACGGCTGAGG ATGTGGTGGA GATAAACTGC CACGGGGGCG CCCTGGCCAC GTCCAGGGTG
CTGCAACTGG TCCTGAGAAC CGGTGCCAGG TTGGCCGAGC CGGGAGAGTT CACTCGCCGG
GCTTTCCTCA ACGGCCGCCT GGACCTGGCC CAGGCCGAAG CGGTCCTGGA GATTATCCGC
GCCAGGAGCA GCAGGGGTTT GACGGCAGCC CTGGATCACC TACGGGGTAA CCTCTCCCGG
AAGATTGGCG AACTGAATGA ACGCTTGACC GGCATCCTGG CGGCCCTGGA AGCCAGTATG
GATTTTCCTG AGGAGGTCGG CGAGGTAGAC CCGGAGAATC TAGCTGACCT GCGCCGCATC
CTGGCGGGAG TTGACAGACT CCTGGCTACC TGGGAAGAAG GCCGACTTTT AACTGAAGGC
TTAAAAGTAG CTATTGTCGG CCGACCCAAT GTCGGCAAAT CAAGCCTGTT AAACGCCCTG
TTAAACCAGG AACGAGCCAT TGTCAGCAAC ATCCCAGGTA CCACCAGGGA TACCATTGAG
GAAACCCTGC AACTCGGGGG ATTTACCTGC CGCTTGATAG ATACAGCCGG GCTGCGGGAG
ACAGCGGATG AATTGGAGAG CATCGGCGTA GCCAGAAGTA AGAAGGCCAT TGCAGCGGCT
GACCTGGTGC TGGTGGTTGT TGACCTGCAA ACAGGAATCC AGGATGAAGA CCGGCGCGTT
TTGGAGAGTG TCAGGGATAA GGTTTTGATA ATCATAGGCA ACAAGCTGGA TCTTGTAGCC
CACGATATAA ATAAAAAATT GGCTGACCTC GAATCCTTTG CCGGAAATTA TCCCCGGGTA
GCTGTTTCCG CCCTCAAAGG TAAAGGATTA GATGAACTGG CCAGAAAAGT CCAGGAGATT
GTCCTGGGTG GAAGAGCCCT GGCAGGTAGC GATGAACCCT TAATCACCAA TGCCCGTCAC
CGGGCTGCCC TGGAAAATTG CCGGGAGCAC CTGGCCAGCG CCATTAAAGC CTGGGAAGAA
GGATTACCTG AGGATTTAAT CGCCATTGAC CTCTGGTCAG CAGCAGATTA CCTGGGAGAA
ATCATCGGAA CCACTGCCCG GGAGGATCTT CTGGACCGGA TATTCAGCGA TTTCTGCATC
GGCAAGTAA
 
Protein sequence
MLDDTIAALA TPPGEGGISI IRLSGSQAIA IVAKVFKPVK GPDLTTTRSH TLRLGFIIDP 
VSGESLDEVL VSVMRAPHSY TAEDVVEINC HGGALATSRV LQLVLRTGAR LAEPGEFTRR
AFLNGRLDLA QAEAVLEIIR ARSSRGLTAA LDHLRGNLSR KIGELNERLT GILAALEASM
DFPEEVGEVD PENLADLRRI LAGVDRLLAT WEEGRLLTEG LKVAIVGRPN VGKSSLLNAL
LNQERAIVSN IPGTTRDTIE ETLQLGGFTC RLIDTAGLRE TADELESIGV ARSKKAIAAA
DLVLVVVDLQ TGIQDEDRRV LESVRDKVLI IIGNKLDLVA HDINKKLADL ESFAGNYPRV
AVSALKGKGL DELARKVQEI VLGGRALAGS DEPLITNARH RAALENCREH LASAIKAWEE
GLPEDLIAID LWSAADYLGE IIGTTAREDL LDRIFSDFCI GK