Gene GM21_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2167 
Symbol 
ID8137503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2529323 
End bp2530699 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID644869782 
Producthydrolase, TatD family 
Protein accessionYP_003021977 
Protein GI253700788 
COG category[L] Replication, recombination and repair 
COG ID[COG0084] Mg-dependent DNase 
TIGRFAM ID[TIGR00010] hydrolase, TatD family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones134 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGA TCGACAGCCA CGCACACATA TACGGCAAGG AGTACGCCGC CGATTTCGAG 
GAGATGATGG AGAGGGCCGC GGAGGCTGGA GTCCGCACCA TCGTGGCGGT GGGAGCGGAT
CTGGAGTCGA GCCAGGAAGC CCTTGCCCTT GCCGGGGCGC GCGAAAACGT CTACTGTTCG
GTCGGCATCC ATCCGCACGA CGCGGACCGG GTGACCGAGC GCTGCTACGA ACTGGTGCGC
GAGATGGCGC TCTCATGCCC CAAAGTGGTC GCCATCGGCG AAATCGGCCT CGACTTCTTC
AGGGACCGCT CCTCGCGCGA CAACCAGGAG GAGGTCTTCC GGCGCTTCAT CAGGATGGGG
CGCGAGCTCT CTCTGCCGCT CATCATTCAC GATCGGGACG CCCACGACAG GATCATGGCG
ATCCTCAAGG AGGAGAAAGC GGGCGAGGTG GGAGGCGTGC TGCACTGCTT CTCCGGCGAC
CTCGCCATGG CGCAGGAGTG CATCGAGCTC GGATTCAAGA TTTCCATCCC GGGGACGGTC
ACCTACCCCT CCAACGAGGC GCTCAGGGAA GTGGTGCGCG GGGTAAAGAT CGAGCAGCTC
ATGGTGGAGA CGGACGCTCC CTACCTGACG CCGGTGCCGC ACCGCGGCAA GAGGAACGAG
CCTGCCTTCG TGCGGCTCAC GGCCGAGCGG GTGGCGCAGG TCAAGGGGCT CTCGGCCGAG
GACGTCGGCA GGATCACCTC TTTTAACACC AGGAAGCTCT TCGGGATCCC GCAACCGGCC
GAGCAAGACA CCATTGCTTA CATGATCCGC AATTCGCTCT ACCTGAACGT CACCAACCGC
TGCTCGAACC GCTGCACCTT CTGCCCCAAG TTCGACGATT TCGCGGTGAA GGGTCACGAG
CTGAAGCTCT CCCACGAACC CAGTTTCGCC GAGGTGATAG CTGCGGTGGA CAGGGCCACC
GGTTTCGAAG AGGTCGTTTT CTGCGGCTAT GGCGAGCCGC TGGTCCGGCT CGACCTGGTG
AAGGAGGTGG CCGCCGAATT AAAGCGCCGC GGCATCAAGG TCCGAGTCAA CACGGACGGG
CAGGCGAACC TCGTGCACGG CAGGAACATC CTCCCCGAAC TCGCAGGCCT TGTGGACGTC
CTCTCGGTGA GCCTCAACGC GGCCAACGCC GAGGACTACC AGCGCTTGTG CAATACTCCC
TTCGGAGCGG CCGGCTTCCA GGGGGTGTGC GATTTTCTCA AGGAAGCGCC CAAGCACGTG
CCCCAGGTGA CGGCAAGCGC CGTGACGGTG CCCGGATTGG ACGTCGGGAA GGTGCGGGAA
CTGGCGCTGT CGCTGGGAGT GGATTACCGC GAGAGGGAAT ACGCGGAGGT AGGCTGA
 
Protein sequence
MELIDSHAHI YGKEYAADFE EMMERAAEAG VRTIVAVGAD LESSQEALAL AGARENVYCS 
VGIHPHDADR VTERCYELVR EMALSCPKVV AIGEIGLDFF RDRSSRDNQE EVFRRFIRMG
RELSLPLIIH DRDAHDRIMA ILKEEKAGEV GGVLHCFSGD LAMAQECIEL GFKISIPGTV
TYPSNEALRE VVRGVKIEQL MVETDAPYLT PVPHRGKRNE PAFVRLTAER VAQVKGLSAE
DVGRITSFNT RKLFGIPQPA EQDTIAYMIR NSLYLNVTNR CSNRCTFCPK FDDFAVKGHE
LKLSHEPSFA EVIAAVDRAT GFEEVVFCGY GEPLVRLDLV KEVAAELKRR GIKVRVNTDG
QANLVHGRNI LPELAGLVDV LSVSLNAANA EDYQRLCNTP FGAAGFQGVC DFLKEAPKHV
PQVTASAVTV PGLDVGKVRE LALSLGVDYR EREYAEVG