Gene Moth_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1387 
Symbol 
ID3831634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1434438 
End bp1435763 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content52% 
IMG OID637829323 
Productradical SAM family protein 
Protein accessionYP_430243 
Protein GI83590234 
COG category[R] General function prediction only 
COG ID[COG2108] Uncharacterized conserved protein related to pyruvate formate-lyase activating enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTATCG AGATCAGCAG GGAGACGCTG GCGGGCATCA GAAACCCCGA CCTGGCGAAG 
TACGCGGGGA TGTACACGAG GATCTACGAG GATTTTATGC GGCAAATCCA GGGGAGTGGC
ATAGCGGTCG CCCGGGAGGA TTACCGGGAA GAGACACGGC AGCGAATAGA GGGTTTGCGC
CGGCAGGGCG TCGTAGTCCG CAACGACGCC AAGAGCCTGT ATATCAACGG TATTTCCCCG
GCGTGCCTGG CCTGCCAGAA GGGCGTGGGG AGCCTGACCT TTTTCATCTC CCTTCAGTGC
CACCGTCACT GCTTTTTCTG TTTTAACCCG AACCAGGAAG GCTACGAGTA TTACACCCAT
AATCAGAGGG ATTGCCTGGC AGAATTGGAG TATCTCCAAA GAACCGGCCA GGAAATGAAA
CACGTGGCTC TGACGGGGGG CGAACCTCTC CTTCATCCAG AAGAAACCCT GGCTTTTTTC
CGCGCCGCCA AAGAAAAATT TCCCGGCGTT TATACCCGCC TCTATACGGC CGGTGATCTG
GCCGGCAAAG AGATGCTGGC GGAATTGCAA AGAACTGGCC TGGACGAGAT ACGCTTCAGC
ATCAGGCTGC ATGACCCGGA AGGGGTGCGG CGGCGCACCT ACGAGCATAT TGCCTTAGCC
AGGGAGTATA TTCCCCGGGT GATGGTGGAA ATGCCCGTCC TGCCCGGTAC CAGGAAACCT
ATGCAGGAAG TATTACTGGA ACTGGATCGC CTAGGCATTT TTGGCATAAA TTTGCTGGAG
TTCTGCTTTC CTTTCAATAA TGTGGATATA TATAACGAAA GGGGGTATAA AATCAAGAAT
CCACCCTATC GGGTGCTTTA CAATTACTGG TACGGCGGGG GCCTGCCGGT AGCCGGGAGC
GAGCTGGATT GCCTGGAGCT GATAGACTTT GCCCTGGAGA AGGGGTTGCA GCTGGGCATT
CACTATTGCT CCCTGGAAAA TAAAAATACC GGCCAGATTT ACCAGCAAAA CTACGGGCAG
AAAGTAGATG CCTTCCTGTA TTTCTCACCA CGGGATTACT TCTACAAATC GGCCAAGGTA
TTTGGAGACG ACATTCCCCG GGTGCTGGAA GTATTTAAGA AAATTAATTA CCACCAGTAT
ACCCTCAATA AGCAATACCA TTTCCTTGAA TTTCATATCA GCAAGGTTAA AGAGCTGGCG
GGACTCGACA TTGAGGTGGG AATTTCGACC AGCGTAATGG AGAAACGCCA GGATGGTAGC
TACCTGCGGA AATTGAAGGT CGAACTGACG CGCCCGGAAA TATTTGATGC GGAAACCGAT
ATTTGA
 
Protein sequence
MLIEISRETL AGIRNPDLAK YAGMYTRIYE DFMRQIQGSG IAVAREDYRE ETRQRIEGLR 
RQGVVVRNDA KSLYINGISP ACLACQKGVG SLTFFISLQC HRHCFFCFNP NQEGYEYYTH
NQRDCLAELE YLQRTGQEMK HVALTGGEPL LHPEETLAFF RAAKEKFPGV YTRLYTAGDL
AGKEMLAELQ RTGLDEIRFS IRLHDPEGVR RRTYEHIALA REYIPRVMVE MPVLPGTRKP
MQEVLLELDR LGIFGINLLE FCFPFNNVDI YNERGYKIKN PPYRVLYNYW YGGGLPVAGS
ELDCLELIDF ALEKGLQLGI HYCSLENKNT GQIYQQNYGQ KVDAFLYFSP RDYFYKSAKV
FGDDIPRVLE VFKKINYHQY TLNKQYHFLE FHISKVKELA GLDIEVGIST SVMEKRQDGS
YLRKLKVELT RPEIFDAETD I