Gene GM21_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1934 
Symbol 
ID8137268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2245005 
End bp2246033 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID644869548 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_003021745 
Protein GI253700556 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGTTC TTGCTTTGGA ATCATCCTGC GACGAAACGG CAGCCGCGGT GGTCAAGGAC 
GGCCGCACCG TCCTCTCCAG CATCGTCGCC TCCCAGATCA GCGTCCACGC CGAATACGGC
GGCGTGGTCC CCGAAATCGC ATCCCGAAAG CACCTGGAGT CCGTCTCCTT CGTGGTGGAA
CAGGCGTTAG CGGAGGCCGG CGTCGGTCTC GACCGGATCG ATGGGATCGC CGTGACCCAG
GGGCCCGGCC TTGCCGGGGC GCTCCTGGTG GGGATCTCCG TCGCCAAGGG GCTCGCCTTC
GGCCGTTCGC TCCCGCTCGT CGGGGTGAAC CACATCGAGG GGCACCTTTT GGCCGTCTTC
CTGGAGGCGC CGGTGCAGTT TCCCTTCATC GCGCTCGCCG TCTCCGGGGG GCACTCGCAC
CTGTACCGCG TGGACGGGAT CGGACGCTAC CAGACTCTGG GGCAGACGGT CGACGACGCC
GCAGGCGAAG CCTTCGACAA GGTGGCGAAG CTGATCGGGC TCCCTTACCC GGGGGGCGTG
GCGATAGACC GGCTCGCCGT CTCGGGTGAC CCTAAGGCCA TCAAGTTCCC GCGCCCGCTT
CTGCACGACG GCACCTTCAA CTTCAGCTTC TCGGGGTTGA AGACCGCGGT GCTGACCCAC
GTCGGCAAGC ATCCGGAGGC GAAGGAGGCC GGGATCAACG ATCTCGCCGC CTCGTTCCAG
GCCGCGGTCT GCGAGGTGCT CACCAAGAAA ACGGCGGCCG CCGTCGCCGC AACCGGGATC
AAAAGGCTGG TCGTGGCCGG AGGTGTCGCC TGCAACAGCG CGCTGCGCCG CTCCATGGCC
GAGTATGCCG CGGCGAACGG GGTGGAACTT TCCATTCCCT CGCCCGCCCT TTGCGCCGAC
AACGCCGCCA TGATAGCGGT CCCCGGCGAC TACTACTTAG GGCTCGGGGT GACGAGCGGT
TTCGATCTCG ACGCGCTTCC GGTCTGGCCC CTGGACAAGC TGGCCCTCCG GCTGAAGGAG
CATTGCTGA
 
Protein sequence
MLVLALESSC DETAAAVVKD GRTVLSSIVA SQISVHAEYG GVVPEIASRK HLESVSFVVE 
QALAEAGVGL DRIDGIAVTQ GPGLAGALLV GISVAKGLAF GRSLPLVGVN HIEGHLLAVF
LEAPVQFPFI ALAVSGGHSH LYRVDGIGRY QTLGQTVDDA AGEAFDKVAK LIGLPYPGGV
AIDRLAVSGD PKAIKFPRPL LHDGTFNFSF SGLKTAVLTH VGKHPEAKEA GINDLAASFQ
AAVCEVLTKK TAAAVAATGI KRLVVAGGVA CNSALRRSMA EYAAANGVEL SIPSPALCAD
NAAMIAVPGD YYLGLGVTSG FDLDALPVWP LDKLALRLKE HC