Gene Moth_0690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0690 
Symbol 
ID3832514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp720787 
End bp721917 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID637828623 
Producthypothetical protein 
Protein accessionYP_429553 
Protein GI83589544 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.47515e-09 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGGGTTG GGGTAGGTTT TAGCAGTGCA AATGACCCTA GCGCTGCTGG ACAGGTTGCT 
TCCGAGCAGG CTGTGCGGCA ATCCGGAAGT CCCGTAATAA CTTTGGTTCT GACCACTGAC
AATTACGATC AAGAGCGGGT CTTGTCGGCC GTCAAAAGAG TTATCGGTAA TTCCAGGCTG
GTAGGCGCCT GCGTTCCGGG GGTCATTGTC AACGCGAGGC TGTACAAGAG AGGGGTGGGT
ATCTGTACAG TCAGTGGAGA AGGTGTGGAG GCGGTAACAC ACCTGCAGAG GAATATTTCC
CAGCACTCGT ACAGAAAAGG CGAGAAAGCG GGCGAAGCTT TGCTGGAAAA GGGCGGTGAA
ACACCGGGGA CAGTATTACT CTTTCCAGAT GGTTTTGCAG CAAACATTTC TGGCCTGCTC
AGGGGATTAT ATAACGTTAT GGGGCCTGCC TTTGAATACA TTGGGGGCGG GAGTGGCGAC
AATCTGCGGT TTTACAGAAC TTATCAGTTT ACTGAGGAAG GTATCAGCAG CGATGCCGTA
GCGGCAGCAG TAATAAGGGG TATAAACTTT CAGATGTGCC TGAGTCATGG CTGGAGGCCG
GTAGGAGAAC CGCTCATGGT GACGAAAGCG AAAGGGAGAA AGGTTTATGA AATCGATGGA
CTTCCGGCAC TGGAGAGATA TTCGGCTCTG GTTGGCGCCT ACGACAAGAA CGATTTTTCC
TGCTACAGCA TGAAGTATCC TTTGGGCTTA CCCTGTGCGG GGGGAGAATT TATTATCCGC
GATCCACTCA AAGCCGAAGA AGATGGGGGC ATTTTATTCG TAACTGAAAT TCCTGAAAAC
ACTATCGCCA CTCTGATGGA AGGGGATACC GCAAGCCTTC TTGCGGCTGC GGAAGAAGTA
TCGAAAAAGG CGTTAAATAC GCCAGCTGCT CCCAAGACCT TTATGGTGTT TGATTGTGTT
TCCCGCTATT TATTGATGGG AGAGGACTTC TCTCGCGAAA TGGAAGCAAT AGCCAAAAAC
ATCAAAGCAG AAATTCCAGT TATAGGGATG CTATCCTTCG GCGAAATTAG CAGCATCTCA
GGGACACCGC TATTTTACAA CAAGACCATT GTAGCTGCCG CGGGGTGGTA G
 
Protein sequence
MRVGVGFSSA NDPSAAGQVA SEQAVRQSGS PVITLVLTTD NYDQERVLSA VKRVIGNSRL 
VGACVPGVIV NARLYKRGVG ICTVSGEGVE AVTHLQRNIS QHSYRKGEKA GEALLEKGGE
TPGTVLLFPD GFAANISGLL RGLYNVMGPA FEYIGGGSGD NLRFYRTYQF TEEGISSDAV
AAAVIRGINF QMCLSHGWRP VGEPLMVTKA KGRKVYEIDG LPALERYSAL VGAYDKNDFS
CYSMKYPLGL PCAGGEFIIR DPLKAEEDGG ILFVTEIPEN TIATLMEGDT ASLLAAAEEV
SKKALNTPAA PKTFMVFDCV SRYLLMGEDF SREMEAIAKN IKAEIPVIGM LSFGEISSIS
GTPLFYNKTI VAAAGW