Gene Moth_0453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0453 
Symbol 
ID3830881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp454692 
End bp456773 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content61% 
IMG OID637828388 
ProductLmbE-like protein 
Protein accessionYP_429327 
Protein GI83589318 
COG category[S] Function unknown 
COG ID[COG2120] Uncharacterized proteins, LmbE homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCA GTGCCGTGAT CCCGGCTTAT AACGAAGAGA CCACCGTAGG CAGAATTATT 
GACACCCTGA AACAAGTAGC TGCAGTTACC GAAATCATCG TCGTCAGCGA CGGTTCCGAA
GATGATACCG CTGCCGTGGC CCGCCACCAC GGGGCCAGGG TACTGGAGCT GGCCGTAAAC
AGCGGCAAAG GGGCAGCCAT GACTGCCGGC GCCAGGGAGG CCAGGGAGGA CATCCTCCTC
TTCCTGGATG CCGACCTGGA GGGACTGCTG CCGGACCACG TCCAGGCCCT CATCGAGCCC
CTGCTGGCGG GCCGGGCCGA GATGAGCGTG GGCATCTTCA GCCGCGGCCG CTCCATGACT
GACCTGGCCC AGGTAGTGGC TCCCCACCTT TCCGGCCAGC GAGCCATCCG CAAAGATCTG
TTTTTAGCTA TCGGCGCCGA CAGGAGCCGT TTCGAGGTAG AGGTCCAGCT CACCAGCGAG
GCCAGAGCCC GGAATTGGCG GGTAGAGAAG GTACCCCTGG TCAACATGAC CCATATTATG
AAGGAAGAAA AAAGGGGCCT GTACCGGGGG GTAGTAGCCA GGATGGGTAT GTATAAAGAT
ATTGCCGGCT TTTTCTGGCG CCTGACCAGG AAGAAGTTAA AGGCGCGGCC GGTAGCCGTG
TTGCTGCTCC TGCTGTCGCT GGGGGTGACC TTTAACTACG ACACCCAGCG AGTGGCTTCC
GCGGAAGCCG GCAGGATGCC TGATTTAAAC CTGCCGGCAG CAGGACAGCG CCTGCTGGTC
GTTTCGCCCC ACCCTGATGA CGAGACCCTG GGCGCCGGCG GCTTGATTGC CAAGGCCAGG
GCCCGGGGGG ATACCGTGAA GGTAGTATTT ATGACCAACG GCGATGGCTT CCGCCGGGGG
GTAGAGACAA CCAGGGGCAT TTTGCCGACC AGTGCCGGTG ATTTTTTGAC TTACGGCGAG
AGACGCCAGC AGGAAGCCAT CACCGCGCTG GGGAACCTGG GGGTGGGGCC GGCGGATATT
ATCTTCATGG GTTACCCGGA CGGGGGGCTG GCCGCCATCT GGAGTAATTA CTGGCAGGAA
GACAAACCCT ACCGCTCGGC CTGCACCCGC AAGGAGGCCG TGCCCTATAG ACTGGCCTTT
AAACCGGGCG AACCTTATGC GGCCCCGGCC CTCCTCGCCG ACCTGGAGGA GATTCTCCGG
GAGTACCGGC CTACAGATAT TTATGTTACC GACACTAACG ACAGCCACCC CGACCACTGG
GCCACCGGGG CCTTCACCTT GGCGGCAGTG GGGGAGCTAA AGGGGGAAGA CCCTACCTTC
AACCCCCGTA TCTATACCTT TGTCATCCAT ACCGGCATGT GGCAAATGCT GCCGGTATTT
GACCGGGACC ATAAACCCCT CCTGCCCCCG GGGTATTTCC TGGCCCGGGG TACGCCCTGG
TATAAATTGC CTCTGGCGCC GGCAATCCTG GAACTGAAAA AACAGGCTAT CGCCGCTTAC
CGGACCCAGG AAATGGTCAT GCCCACTTTC CTGGCCAATT TTGAGCGGCC CAACGAGGTC
TTCTCCCGCC TGCCGGACCA GGAGGTGATT ACCACAGCGA CGGGCATGAG TGTCGACGGT
TGGGTTAAGG AATGGCCCCG GGATGCCGTC ATTGCCCTTG ACCCCGCCGG TGACCTGGTG
ACAAAAAAAG TAGAGCGGGG CGGCGATCTC AAGGCGGCCT ACCTGCTCCA GTCCGGCCGG
ACCACCTATT TGCGCCTGGA CACCTGGGGC CGGGTCGGTT TTCCGGTAAA TTACACCCTG
AGCATCTACC TGTTGCCGGC TTCTCCCGGG GCCGGTAGCC AGCGCTTTAC CTGGTCCTGG
GCACCCGGCG AGAAACAGGT CAGGTGGCTG ACCCGCCCGG CCGGTTACGA CCCGAATGCC
ATCCGGGTAG CTTCCGGAGG CGACAGCCTG GAGATGGCCC TGCCGGACCT TATTCCTCCC
GGCGAGCACT ACCTGATGTT CACCGCCGTC ACCTCTATCG GCAGGCTGCC CCTGGACCGG
ATCCCCTGGC GCCTCGTAAA GATTAAGGGA AGCGATTTAT AA
 
Protein sequence
MGVSAVIPAY NEETTVGRII DTLKQVAAVT EIIVVSDGSE DDTAAVARHH GARVLELAVN 
SGKGAAMTAG AREAREDILL FLDADLEGLL PDHVQALIEP LLAGRAEMSV GIFSRGRSMT
DLAQVVAPHL SGQRAIRKDL FLAIGADRSR FEVEVQLTSE ARARNWRVEK VPLVNMTHIM
KEEKRGLYRG VVARMGMYKD IAGFFWRLTR KKLKARPVAV LLLLLSLGVT FNYDTQRVAS
AEAGRMPDLN LPAAGQRLLV VSPHPDDETL GAGGLIAKAR ARGDTVKVVF MTNGDGFRRG
VETTRGILPT SAGDFLTYGE RRQQEAITAL GNLGVGPADI IFMGYPDGGL AAIWSNYWQE
DKPYRSACTR KEAVPYRLAF KPGEPYAAPA LLADLEEILR EYRPTDIYVT DTNDSHPDHW
ATGAFTLAAV GELKGEDPTF NPRIYTFVIH TGMWQMLPVF DRDHKPLLPP GYFLARGTPW
YKLPLAPAIL ELKKQAIAAY RTQEMVMPTF LANFERPNEV FSRLPDQEVI TTATGMSVDG
WVKEWPRDAV IALDPAGDLV TKKVERGGDL KAAYLLQSGR TTYLRLDTWG RVGFPVNYTL
SIYLLPASPG AGSQRFTWSW APGEKQVRWL TRPAGYDPNA IRVASGGDSL EMALPDLIPP
GEHYLMFTAV TSIGRLPLDR IPWRLVKIKG SDL