Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3631 |
Symbol | |
ID | 8139005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4210385 |
End bp | 4211854 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871252 |
Product | D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase |
Protein accession | YP_003023410 |
Protein GI | 253702221 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2027] D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) |
TIGRFAM ID | [TIGR00666] D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 156 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTATC TTCGCAAGGC CATAGGGGCT GTGCTTTCAT TGCTGATTAT CGCCGCGGGT TCGCCTGTCG CGACCTTTGC CGGACAAACT GCTCCCCCTG CACGAACCGA ACTCGGCGTC ACCAGCAATT ACACCGCTGC CCTAAAAAAG GAGATCGACG CCATACTGGC CCGGGAGTTC CTGCCCGTCA CCAGCGCGGG TATCAAGGTG GTCTCGCTCA AGCGTGGCGA AACCATCTAC GAGTTCAACC CGCGGCTGCT CTTGGTTCCT GCGTCCACTC AAAAGGTCTT CACCGCGGCT GCTGCCTTGT CTATGCTGGG ACCGGACCGG GAGGTCGCTA CCACTGTTGC GCTCGACGCG GCCGGGACGA GGATCTACCT CAAAGGATGC GGCGACAGTC TGCTTTCCGC GGCTGACTTG ACCGCCCTGG CCGCGGCTGC GGCCCCCAAG CTGGACAAGG GGAGGGAGTA TAGCCTTTCT GCCGACCTTT CCTGCTTTGA TGACCTCTAC CGGGGCAAAG GGTGGATGTG GGACGACGAC GAGATGATGA TCTCTCCCCT GTCGGTCAAC CACAATGCCG TCTCACTGCT GGTGCAGCCT GGCGCCAAGG CGGGAGCCCC GGCCGTTATC ACCTCGGAGC CTCGCACCTC CTACTACACC GTTCAAAATC TGACCAGGAC CGGTAGTGCC AAGGATGAAA GCAGTATCCA GGCCTATAGG CGCCCCGGCG AGCGGGACAA CGTGGTCACG GTGACCGGAG TCATACCGTT GGGGAGCGCC CCTCTGGTTA AACAGGCCAG CGTTTGGCGG CCGGAGATGA TGGCGCTCAC CCTGTTCCGG GATGCGTTAC GGGCGCAGGG GATTAAGGTC GGCACCATGA CTACGGCACC CACTCCGGCG GGAGTAACCG AGGTGGCGCG CACAGCCCGC CGCGTGGAGG AGTTGGTCCG GTTCGCTTTG AAGACCAGCG ACAACGTGAC AGCCGAGAGT CTGCTTAAAC TGCTGGGTCT GCATGGGAGC GGAAAGCGGG GATCGGCGGA GGCGGGGAGC GTTGCGGTGC GCCGTTACCT GGAGAGGCAC GGAATAGCCA CTGATAACGT AGTGGTCGCA GATGGTTCGG GACTTTCGCG TTACAACCTT TCCAGCGCCG AGGCTATGAT CCAGACGTTG CAAGCTATTC ACCGCGACCC CGGGCTGTAC CGCATCTTTC AGGAATCCCT TCCTGTAGCA GGTATGGATG GCACGTTGAA GAACCGCATG AAGGGGAGCT GCGCCGAAGG GAACGTGAGG GGGAAAACCG GAAACATGAA AGGCGTCTCC GCCTTAGCCG GCTACGCCAC CAGCGCCGAC GGAGAACCGT TCGCCTTTTC CATCATCATC CAGAACTACG CCGCCACCGG AAAGCAGGCC CGTAAGGTAC AGGATCGGAT CGCGGCACTG CTTTGCAGTT TCAGGCGCAG CACGAAATAG
|
Protein sequence | MSYLRKAIGA VLSLLIIAAG SPVATFAGQT APPARTELGV TSNYTAALKK EIDAILAREF LPVTSAGIKV VSLKRGETIY EFNPRLLLVP ASTQKVFTAA AALSMLGPDR EVATTVALDA AGTRIYLKGC GDSLLSAADL TALAAAAAPK LDKGREYSLS ADLSCFDDLY RGKGWMWDDD EMMISPLSVN HNAVSLLVQP GAKAGAPAVI TSEPRTSYYT VQNLTRTGSA KDESSIQAYR RPGERDNVVT VTGVIPLGSA PLVKQASVWR PEMMALTLFR DALRAQGIKV GTMTTAPTPA GVTEVARTAR RVEELVRFAL KTSDNVTAES LLKLLGLHGS GKRGSAEAGS VAVRRYLERH GIATDNVVVA DGSGLSRYNL SSAEAMIQTL QAIHRDPGLY RIFQESLPVA GMDGTLKNRM KGSCAEGNVR GKTGNMKGVS ALAGYATSAD GEPFAFSIII QNYAATGKQA RKVQDRIAAL LCSFRRSTK
|
| |