Gene Moth_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1364 
Symbol 
ID3832286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1407172 
End bp1408476 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content60% 
IMG OID637829300 
Productdiaminopimelate decarboxylase 
Protein accessionYP_430220 
Protein GI83590211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.844058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAA ATGAACACGG ACGGCTGGTC ATCGGCGGCT GTGAGGTTGT CCAGCTGGCC 
CGTGACTTTG GCACCCCCCT TTATATTTTT GATGAAGACT GTATCCGGGA CAACTGCCGC
CAGTTTTATC GGGCCTTTAA CGCCGGCAGC GGCCAGGCCG AGGTTATCTA TGCCGGCAAG
GCTTTTCTTA CTACGGCCAT GTGCCGGATC ATCGCCAGTG AAGGGCTGGG ACTGGATGTT
GTCTCCGGCG GCGAACTTTA CACCGCCCTG GCGGCCGGTT TTCCCGTGGA ACGGATTTAT
TTTCATGGTA ACAACAAGAG CTATGCCGAG CTCTGCCAGG GCCTGGAGGC CGGAATAGGC
AGGTTCATGG TCGATAACTT TACAGAGCTG GAACTGTTAA GCCGCCTTGC AGTGGAGCGT
GGCCAGGTGG CCAAGGTAAT CCTGAGGGTC ACCCCGGGAA TTGAGGCCCA CACCCATGAC
TATATCCGCA CCGGCCAGGT TGATTCCAAG TTCGGTTTTA CCCTGCCGGG GGGCCAAGCA
CTAGCGGCTG CCAGGAGGGC CGGCGAGTTA CCGGGGATCG AGTTCATGGG TTTGCACTGC
CATATTGGCT CCCAGATTTT CGAACTGGAG CCCTACAATG AAGCCGTAGC CGTGATGATG
GAGCTGGCTG CTGCGGTCAA GGAGGCGACA GGGCTGGTAA CCGCCGAGTT GGATCTCGGG
GGCGGCTTTG GCATATACTA TACTACCGGT GACGAACCCC GGCCGATAAG GGCTTATGCG
GAAACCATCC TGGCCCGGGT CCGGGAGGAA GCCCGGCGCT TGAACCTGCC CCAACCGCGA
GTCCTGGTCG AACCGGGTCG GTCCATTGTC GGGCCGGCGG GCAGCACTGC CTACACCGTC
GGGAGCATCA AAGAGATCCC CGGCGTCCGC AAGTATGTGG CTGTTGACGG CGGGATGGCC
GATAACATTC GCCCGGCCCT TTACGGTGCG AAGTATGAGG CCATCCTGGC CAATAAGGCC
GGTATGCCGG CTACAGAGAA GGTAACCGTC ACCGGGAAGT GCTGCGAATC AGGGGATATG
CTGATCTGGG AAGCGGAATT GCCCCCGGTA GAGAGGGGCG ACATCCTCCT GATGCCCTGT
ACCGGGGCCT ATGGTTATAC CATGGCCAGC AATTATAACC GCCTGGGCCG GCCGGCAGCC
GTCCTGGTGC GGGATGGCGT CGCCGACTTG ATTATAAAGC GGGAGGATTA CAGCGACCTG
ATCCGTAACG ACGTGATCCC GGCCCGCCTG GCCTGCCCTC GCTAG
 
Protein sequence
MAINEHGRLV IGGCEVVQLA RDFGTPLYIF DEDCIRDNCR QFYRAFNAGS GQAEVIYAGK 
AFLTTAMCRI IASEGLGLDV VSGGELYTAL AAGFPVERIY FHGNNKSYAE LCQGLEAGIG
RFMVDNFTEL ELLSRLAVER GQVAKVILRV TPGIEAHTHD YIRTGQVDSK FGFTLPGGQA
LAAARRAGEL PGIEFMGLHC HIGSQIFELE PYNEAVAVMM ELAAAVKEAT GLVTAELDLG
GGFGIYYTTG DEPRPIRAYA ETILARVREE ARRLNLPQPR VLVEPGRSIV GPAGSTAYTV
GSIKEIPGVR KYVAVDGGMA DNIRPALYGA KYEAILANKA GMPATEKVTV TGKCCESGDM
LIWEAELPPV ERGDILLMPC TGAYGYTMAS NYNRLGRPAA VLVRDGVADL IIKREDYSDL
IRNDVIPARL ACPR