Gene GM21_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3631 
Symbol 
ID8139005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4210385 
End bp4211854 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content61% 
IMG OID644871252 
ProductD-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase 
Protein accessionYP_003023410 
Protein GI253702221 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2027] D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) 
TIGRFAM ID[TIGR00666] D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones156 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTATC TTCGCAAGGC CATAGGGGCT GTGCTTTCAT TGCTGATTAT CGCCGCGGGT 
TCGCCTGTCG CGACCTTTGC CGGACAAACT GCTCCCCCTG CACGAACCGA ACTCGGCGTC
ACCAGCAATT ACACCGCTGC CCTAAAAAAG GAGATCGACG CCATACTGGC CCGGGAGTTC
CTGCCCGTCA CCAGCGCGGG TATCAAGGTG GTCTCGCTCA AGCGTGGCGA AACCATCTAC
GAGTTCAACC CGCGGCTGCT CTTGGTTCCT GCGTCCACTC AAAAGGTCTT CACCGCGGCT
GCTGCCTTGT CTATGCTGGG ACCGGACCGG GAGGTCGCTA CCACTGTTGC GCTCGACGCG
GCCGGGACGA GGATCTACCT CAAAGGATGC GGCGACAGTC TGCTTTCCGC GGCTGACTTG
ACCGCCCTGG CCGCGGCTGC GGCCCCCAAG CTGGACAAGG GGAGGGAGTA TAGCCTTTCT
GCCGACCTTT CCTGCTTTGA TGACCTCTAC CGGGGCAAAG GGTGGATGTG GGACGACGAC
GAGATGATGA TCTCTCCCCT GTCGGTCAAC CACAATGCCG TCTCACTGCT GGTGCAGCCT
GGCGCCAAGG CGGGAGCCCC GGCCGTTATC ACCTCGGAGC CTCGCACCTC CTACTACACC
GTTCAAAATC TGACCAGGAC CGGTAGTGCC AAGGATGAAA GCAGTATCCA GGCCTATAGG
CGCCCCGGCG AGCGGGACAA CGTGGTCACG GTGACCGGAG TCATACCGTT GGGGAGCGCC
CCTCTGGTTA AACAGGCCAG CGTTTGGCGG CCGGAGATGA TGGCGCTCAC CCTGTTCCGG
GATGCGTTAC GGGCGCAGGG GATTAAGGTC GGCACCATGA CTACGGCACC CACTCCGGCG
GGAGTAACCG AGGTGGCGCG CACAGCCCGC CGCGTGGAGG AGTTGGTCCG GTTCGCTTTG
AAGACCAGCG ACAACGTGAC AGCCGAGAGT CTGCTTAAAC TGCTGGGTCT GCATGGGAGC
GGAAAGCGGG GATCGGCGGA GGCGGGGAGC GTTGCGGTGC GCCGTTACCT GGAGAGGCAC
GGAATAGCCA CTGATAACGT AGTGGTCGCA GATGGTTCGG GACTTTCGCG TTACAACCTT
TCCAGCGCCG AGGCTATGAT CCAGACGTTG CAAGCTATTC ACCGCGACCC CGGGCTGTAC
CGCATCTTTC AGGAATCCCT TCCTGTAGCA GGTATGGATG GCACGTTGAA GAACCGCATG
AAGGGGAGCT GCGCCGAAGG GAACGTGAGG GGGAAAACCG GAAACATGAA AGGCGTCTCC
GCCTTAGCCG GCTACGCCAC CAGCGCCGAC GGAGAACCGT TCGCCTTTTC CATCATCATC
CAGAACTACG CCGCCACCGG AAAGCAGGCC CGTAAGGTAC AGGATCGGAT CGCGGCACTG
CTTTGCAGTT TCAGGCGCAG CACGAAATAG
 
Protein sequence
MSYLRKAIGA VLSLLIIAAG SPVATFAGQT APPARTELGV TSNYTAALKK EIDAILAREF 
LPVTSAGIKV VSLKRGETIY EFNPRLLLVP ASTQKVFTAA AALSMLGPDR EVATTVALDA
AGTRIYLKGC GDSLLSAADL TALAAAAAPK LDKGREYSLS ADLSCFDDLY RGKGWMWDDD
EMMISPLSVN HNAVSLLVQP GAKAGAPAVI TSEPRTSYYT VQNLTRTGSA KDESSIQAYR
RPGERDNVVT VTGVIPLGSA PLVKQASVWR PEMMALTLFR DALRAQGIKV GTMTTAPTPA
GVTEVARTAR RVEELVRFAL KTSDNVTAES LLKLLGLHGS GKRGSAEAGS VAVRRYLERH
GIATDNVVVA DGSGLSRYNL SSAEAMIQTL QAIHRDPGLY RIFQESLPVA GMDGTLKNRM
KGSCAEGNVR GKTGNMKGVS ALAGYATSAD GEPFAFSIII QNYAATGKQA RKVQDRIAAL
LCSFRRSTK