Gene Moth_1499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1499 
Symbol 
ID3831726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1544289 
End bp1545527 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID637829431 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_430351 
Protein GI83590342 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGAAAC TTACAGCGAT ACTGCTGGCC TTACTCCTGG CCCTCGGAAT GGGGCCGGGA 
TTCTTGCTGG CGGCTGATCC GGCGGATGGG GCCGGAAACG AAGCCGCGCC GGTCACCGGG
CAGCCAGGTG AAGTCCAGGC AACGGCAGGC GCGGCAGCCG GGCCGGGAGA TATCAATGCC
AAGGGCGCCA TCCTCATGGA CCCCCAGACA GGGGAGATCC TCTGGGAGAA GAACTCCCAT
GCCCATTATT ATCCGGCCAG CATGACCAAA CTCATGACCA TGGTCGTGGC TATGGACATG
GTCGCCAACG GTCAGGCCAC CCTGGATGAA CCGGTCCAGG TGAGCGAACG GGCCGAAAGC
TTTGGTGGGT CGGAGGTCTT CCTGGCTGTA GGCGAAACCT TTCCCCTGGA GCAGATGCTC
ATCGCTATTG CCGTGGCCTC GGCCAACGAC GCGGCCGTAG CGGTGGCCGA GCATCTGGCG
GGTTCGGAGG AAGCCTTCGT GGCCATGATG AACGCCAAGG CCAAGGAACT GGGCCTCAAG
GATACCCACT TCGCCAACTG TCACGGCCTC CATGACGAAC AGAATTATAC CTCAGCCTAT
GATATGGCCG TAATTGCCCG TTACGCCCTG AAGTACCCCA AAATCCGCGA GTGGACCTCC
ATTAAGCGCT ATACCCTGCG TAAAGATCCC CTGACCATCC TGGATACCAC CAACAAGATG
CTCTATTGGT ACCCGGGAAC GGACGGGTTT AAGACCGGCT TCACCGATGC TGCCGGCTTG
AACCTTGTTT CAACGGTGGA GAGGGACGGT TTGCGGTTGG TCGCCGTTGT CATGGGCGTC
GAGACGCCCC AGGGGCATTT TACCGAATCT ATGAAGCTGT ACAACTGGGC CTTTAAACAG
TGGGCCTTCA AGGAGTTTTA CGGCCCGGGC CAGGTAGTGG CCAGCATCCC GGTGGGCAAG
GGCCAGGTGG AGCAGGTAAA GATTGTTACG ACCGGGAAGG TCGGTGCCCG GATAAGCCGC
CTCCGGGGTA AGGCTGAGGG CGTGACAACC AAAGTCGAAC TGCCGGGTAT TGTCAACGCC
CCGGTGAAGG AGGGGCAGGT TGTCGGCCAG GCGCTGGTCC TAAGGGACGG GCAGGTGATC
GATAGAGTAC AGCTGGTGAC GCAACAAAAG GTGGCAAAAG CCTCCCTGGG CCAGGAGATT
GTCCGGGTCA TCCGAGCCGT TTTCACTATT CGACAATAA
 
Protein sequence
MQKLTAILLA LLLALGMGPG FLLAADPADG AGNEAAPVTG QPGEVQATAG AAAGPGDINA 
KGAILMDPQT GEILWEKNSH AHYYPASMTK LMTMVVAMDM VANGQATLDE PVQVSERAES
FGGSEVFLAV GETFPLEQML IAIAVASAND AAVAVAEHLA GSEEAFVAMM NAKAKELGLK
DTHFANCHGL HDEQNYTSAY DMAVIARYAL KYPKIREWTS IKRYTLRKDP LTILDTTNKM
LYWYPGTDGF KTGFTDAAGL NLVSTVERDG LRLVAVVMGV ETPQGHFTES MKLYNWAFKQ
WAFKEFYGPG QVVASIPVGK GQVEQVKIVT TGKVGARISR LRGKAEGVTT KVELPGIVNA
PVKEGQVVGQ ALVLRDGQVI DRVQLVTQQK VAKASLGQEI VRVIRAVFTI RQ