Gene GM21_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2007 
Symbol 
ID8137341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2327119 
End bp2328444 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID644869620 
Productheat shock protein DnaJ domain protein 
Protein accessionYP_003021817 
Protein GI253700628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value2.8484999999999997e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGCTC GTTTTAAGGG AAAACACCCG GAAGAGATCG AACTCGAGCA AAAACGCGCC 
GAGCTGGCGT CGTTGCGCGC CCGGCACGCG GAGCTTACGG CGGAGCTGCA GCAACTGCGC
GAGGAGATCG CCGGTTTCGA GAAAAATTAC CAGGCGACTC TTGGGCTGCG CATGGCGGAA
CTGGAGCGCC TGGAGGCGGA GATAGCGCGC CTCAACGGCG GCGGCGCTGA TAACGACCCG
GAGATCCAGG AGGAGTACCT GGACCGAGAA ACCTACAGCC GCGGCAAATC TTTCAAGTCG
GCTGCCGGCG CCGGATCGCG CGGCAGGGTG TGGAAAGCTG AAGAAGAGGA CATCAAGGCG
CTATACCGGG AGGTCGCCAA GGCGATCCAC CCGGACCTTG CCGGGGAGGG CGCCGGCAGC
TTAAGGCACG AACTGATGTT GAAGGCGAAC CAGGCGTACG CGGATGAGGA TTCCCGTGCT
TTGAGGGAGA TACTGCGCAG TTGGAGAAGG CATCATCCGG AGCGGCAGCC TGAGGCGCCG
GATACCAAGG TGGAACTGGC GCGGGTGAAA AGGGAGATAG CGGTGGAGGC GCAGGCGGTG
CATGACCTGA GCGATCAGGT GGAGCAACTC AGGACGAGCT ACGTCTGCTG CTTTAAACTG
CGGGTCGACC AGAGCCTGGC GGAAGGGAGC GATCTATTCG CCGAGATGAT AGCTGCGGCG
GACATGAATG TCGCCAGGGC GCAGCGCCGG CTGGCTGCCC TGAGAAGCGA GAAGGCGCGG
GAGGCTGAGG GACGCACCAG GGTGAGAAGA AGTATCCAGT TCCCCGAAGG TCTTTCTTGC
GGAACTCTCT ACTTCCGGGA TCTTGCCTCG GCGGACTTCA GCCAGTGGAA GAAGGCGGGG
CCTGCGGTGG GAAGGGTGGA GGTGTACATC GACCAGGCCG TGCGCCTGGA CGTGAAGGAG
CAGGCGGGAC CGGACCTCAA GCTTTTGCAG CAATTGAGAC CAAACGACCT TCAGGCGCTC
TTTCTCTACG AGATGACCGA CGCGAACCTC GACAACATCG TGCACCTGAG CGGCCTGGAG
GAGCTTTACC TCTGTGGCCA GGGGTTGACC GACGCTGCGC TTCTTTGCAT CTCCTCGCTC
ACCAACCTGA AGAGGATCTA CTTGTACCAG ACCGCCATCT CGGACCGGGG GCTCGTTTAC
CTCCAGGGGC TGCAGGGGCT GAAGGGGCTC ACCAGCAGCG GCAACAGCAT CACCGAGGAG
GGGCTCGCGA TATTCCAGAA AGCCATCCCC GGCGTCAAGA CGGTAAGTTT CAAGTGGAGA
CGGTGA
 
Protein sequence
MTARFKGKHP EEIELEQKRA ELASLRARHA ELTAELQQLR EEIAGFEKNY QATLGLRMAE 
LERLEAEIAR LNGGGADNDP EIQEEYLDRE TYSRGKSFKS AAGAGSRGRV WKAEEEDIKA
LYREVAKAIH PDLAGEGAGS LRHELMLKAN QAYADEDSRA LREILRSWRR HHPERQPEAP
DTKVELARVK REIAVEAQAV HDLSDQVEQL RTSYVCCFKL RVDQSLAEGS DLFAEMIAAA
DMNVARAQRR LAALRSEKAR EAEGRTRVRR SIQFPEGLSC GTLYFRDLAS ADFSQWKKAG
PAVGRVEVYI DQAVRLDVKE QAGPDLKLLQ QLRPNDLQAL FLYEMTDANL DNIVHLSGLE
ELYLCGQGLT DAALLCISSL TNLKRIYLYQ TAISDRGLVY LQGLQGLKGL TSSGNSITEE
GLAIFQKAIP GVKTVSFKWR R