Gene Moth_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0041 
Symbol 
ID3830907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp41660 
End bp43069 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content68% 
IMG OID637827973 
Productarginine decarboxylase 
Protein accessionYP_428923 
Protein GI83588914 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1982] Arginine/lysine/ornithine decarboxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000185697 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGACCC AAGGGCAGGC GCCTCTATGG GAGGCGCTTT TAAACTATCG TCAGCAAGGA 
CTTAATTCCT GGCATACTCC CGGTCACAAG GACGGGGCCT ATACCTTGCC CCTGTGGCGG
GATTTCCTGG GCGCGGCCCT GGCTCTGGAC CTGACTGAGG TACCTGGCCT GGATAACCTG
GCCTGTCCCG GGGGGGCTAT CGCCGCCGCC CAGGAGCGGG CGGCCCGCTT TTATGGCGCA
GCCCGGACTT TTTTCCTGGT AAACGGTGCG AGCGCCGGCC TGATGGCCGT CATCCTGGCC
ACCTGCCGCC CCGGGGATGA AGTTTTGCTG CCCCGGTACG CCCACCGGGC GGTTTTTAAT
GGCCTGATTT TAAGTGGGGC CCGGCCGGTT TACCTGGCCA CGGAATGGCT GGCCAGCCCG
GGCCTGCCCC TGGGGGTAGC CCCGGAAAGC CTGGCCGGAA CCTTAAGGGA ACATCCCGGG
GCCAGGCTCC TGCTCCTGGT CCATCCTACC TATGAAGGTG TAGTACCCCG GAGCGAAGAA
CTAATTGCCC TGGCCCACGC CCACGGCGTG GCGGTCCTGG CCGATGCCGC CCATGGGGCC
CACTTTGGCC TGGCTCCGGG CCTGCCACCG TCTCCCCTGG ACCTGGGGGC TGATTTTGTC
GTCCAGAGTA GCCATAAAAC CCTGGCGGCC CTGACCCAGG CAGCCATGCT GCACCTGCGG
GAGGAGGCAG CGGCAGCAAG GGTGGCGGCA GCCCTGAACC TGCTCCAGAC TACCAGTCCT
TCCTACCTCC TGCTGGCTTC CCTGGATACG GCCCGTCTCC TGGCGGAAGA ACGGGGCCGG
CAGGACTGGG GTCTGACGGT AGCCCGGGCT ACCGCGGCCC GGGCCAGGCT GGATCGGGCC
GGACTACCGC CCCTGGCAAT GGCAGATGTT ACCGGGCCGG CGGCCAGCGG CCTGGATGTA
ACCCGCCTGC TCCTGCCTAC GGCGCCCCTG GGCCGGAGGG GAACGGAGGT GGCTGCCACC
CTGCGCCGCG CCGGCCAGGA GGTAGAACTG GCGGGAGAGG ATTATATCGT GGTAATCATC
ACCCCCGGCG ACGGTGAGGA GAAAATCGAG GCCCTGGTGA CCGCCCTGCT GGCCCTGCCG
CGATCCGACA GTAGGCAGCC GGCAGCTTTA GCCCCGGCAC CGCCCGTCAG GATACCCGAG
ACGGCTTTGA CGCCCCGGGA GGCCTGGCTT GCCCCCCGGC GGGAGTTGCC CCTGGGGGAG
GCTACGGGGA AGATTGCCGC CGAACTCATC TCCCCCTGCC CGCCGGGCCT GGCCCTGACG
GTACCGGGGG AGGTTCTGAC GCCGGAGGTC CTCGAGCGGC TCAGGGATTT ACGGGGACCG
TCTGGCCGGG TGCTGGTAGT GGATGGCTAA
 
Protein sequence
MKTQGQAPLW EALLNYRQQG LNSWHTPGHK DGAYTLPLWR DFLGAALALD LTEVPGLDNL 
ACPGGAIAAA QERAARFYGA ARTFFLVNGA SAGLMAVILA TCRPGDEVLL PRYAHRAVFN
GLILSGARPV YLATEWLASP GLPLGVAPES LAGTLREHPG ARLLLLVHPT YEGVVPRSEE
LIALAHAHGV AVLADAAHGA HFGLAPGLPP SPLDLGADFV VQSSHKTLAA LTQAAMLHLR
EEAAAARVAA ALNLLQTTSP SYLLLASLDT ARLLAEERGR QDWGLTVARA TAARARLDRA
GLPPLAMADV TGPAASGLDV TRLLLPTAPL GRRGTEVAAT LRRAGQEVEL AGEDYIVVII
TPGDGEEKIE ALVTALLALP RSDSRQPAAL APAPPVRIPE TALTPREAWL APRRELPLGE
ATGKIAAELI SPCPPGLALT VPGEVLTPEV LERLRDLRGP SGRVLVVDG