Gene Moth_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1646 
Symbol 
ID3830934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1681626 
End bp1682831 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content57% 
IMG OID637829571 
Productpeptidase 
Protein accessionYP_430491 
Protein GI83590482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR03320] M20/DapE family protein YgeY
[TIGR03526] putative selenium metabolism hydrolase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.017723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG CTGTCATTGC CCAGATTAAG TCCGAGGTGG CCAAATATCG GGGAGACATA 
ATCCGGTTCC TGAAGGACAT AGTCGCCATC CCCAGCCCCA ACGGGGATAT CAAGGCCGTA
GCCGAGCGCA TCGGCCAGGA GATGAGAAAA CTGGGTTTCG ACGACGTCTT CCTGGACAGC
ATGGGCAATA TCGTCGGCCG CATCGGCAGC GGCCCCAGGG TACTCCTGTA TGACAGCCAT
ATTGACACCG TGGATATCGC CGATTCCGAT CAGTGGCAGT GGGACCCCTA CAAGGGCAAG
GAAGAGAATG GTATTTTCTA CGGTCTGGGG GCCGGGGATG AGAAGAATTC CACTCCGGGG
ATGGTTTACG GACTGAAGAT CATCAAGGAC CTGGGCCTGG CCGATGACTT TACCCTTTAC
TATTTCGGTA ATATCGAGGA GATCTGCGAC GGAGTGGCGC CCAACTCCCT GGTGGTCACC
GATAAGATCA AACCCGACTT TGTTGTTATC GGTGAGCCTA CCAAAATGAA CATCTACCGG
GGTCACCGCG GCCGGGTGGA GATGAAGGTT ACCACCAAAG GCCGGACTTG CCACGCCAGC
GCCCCGGAGC GCGGGGTCAA CGCCGTTTAC AAAATGGCGG AAATTATCAA GGGCATTAGC
CAGATGGGCG CCGACTTCGT TGAGGACCCC TTCCTGGGAA AGGGGTCTAT AGCCGTCACC
GACATCCACT GCAAAACGCC CTCCATCAAT GCCTTACCCG ACGAGTGCGT GATTTACATT
GACCGCCGCC TGACCTTCGG TGAGACCCAG GAGATGGCCG TCGAGCAGGT GCGTAAAGTA
GCCGAGCCCC ACGGCGGCAA GGTCGAGGTG CTGGAGTTTG ACGAGCCCAG CTATACCGGC
TTTGTCTTCA AAGTCGACAA ATACTTCCCG GCCTGGGTCC TGCCTGAGGA TCACCTCCTG
GTCAAGGCCG GCCTGGAAAC CTATCAACGG GTTTTCGGCC AGCCCACCGG GGTGGGGAAA
TGGGTCTTCA GCACCAACGG TATTTACTGG ATGGGTAAAG CCGGCATTCC CGCCATCGGC
TTTGGCCCCG GCGACGAGGT CTACGCCCAC AGCGTCCTCG ACCAGGTGCC CATCGAAGAC
GTCGTCCGTT CCACCGAGTT CTACGCCTAC TTCCCCACGG TTTTAAGGGA AATGCTGGCC
AGATAA
 
Protein sequence
MSDAVIAQIK SEVAKYRGDI IRFLKDIVAI PSPNGDIKAV AERIGQEMRK LGFDDVFLDS 
MGNIVGRIGS GPRVLLYDSH IDTVDIADSD QWQWDPYKGK EENGIFYGLG AGDEKNSTPG
MVYGLKIIKD LGLADDFTLY YFGNIEEICD GVAPNSLVVT DKIKPDFVVI GEPTKMNIYR
GHRGRVEMKV TTKGRTCHAS APERGVNAVY KMAEIIKGIS QMGADFVEDP FLGKGSIAVT
DIHCKTPSIN ALPDECVIYI DRRLTFGETQ EMAVEQVRKV AEPHGGKVEV LEFDEPSYTG
FVFKVDKYFP AWVLPEDHLL VKAGLETYQR VFGQPTGVGK WVFSTNGIYW MGKAGIPAIG
FGPGDEVYAH SVLDQVPIED VVRSTEFYAY FPTVLREMLA R