Gene GM21_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3112 
Symbol 
ID8138462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3608911 
End bp3610221 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content64% 
IMG OID644870716 
ProductNHL repeat containing protein 
Protein accessionYP_003022898 
Protein GI253701709 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones155 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTA ATCGTGGTGA TCGCAAGGTC CGTAGTGTGG TTCGAATGCT GGCGCTGGTG 
CTTTTGTCCG CCTGCATGAC CGCCTGCGCC GGCAAAACGG CAAAGAACAG CGTCTTTTGG
CCCGGCGCGC CCGATCTGCC CCGCATCCAG TTCCTGACCG CATTCAAGGA TTCGAAGGAT
GTCATCGGCG AGAAGAAGCT TTCGCTTCTC GACGTCGGGG GAACGCCGGA CATCTTCATC
AACCTGGTGA AGCCGTACGG GATCACGTCC GCCAACGGGA AGCTGTACAT CTGCGACACG
CTGCAGGCGG ACGTCATCAC CGTCGACCTC CCGAACAAGA AGATGACCCG CCTCTCCGGG
AACGTGAACG CCGGCAGGCT GAAAAAGCCG GTCAACGTGG CCGTCGACGC CAGGGGGAAC
ATCTACGTCG CCGACACCTC GCGGCTCGAA GTGCTGCAGT ACGCCCCGGA CGGCTCCTAC
GTAAGGAGCA TCGGCACCAG CAAGGACCTC AAACCTGTCG ACGTGCGCGT CGACGACCTC
TACCTCTACA TCCTGGACGG GATGACCAGC CAGGTCCACC TCTACGACAT CGCCGGCGGC
GACTACGTCA AGTCGATCGG CAGAAACGAC GACCCCAAGA GAAACCTGGC CGGACCGACC
AACATGGCGC TCGACAGCAA GGGGGGGGTC TACGTCAGCA ATTTCGGCAG CGGCCGGATC
ATCAAGCTGG ATAGGGACGG GAACTTCCAC CTGGGATTCG GCAAGCTCGG CAGCTCCTTC
GCCGATTTCA CCCGCCCGCG CGGGATAACC GTGGACGAAA CCGGCCTGGT GTACGTGGTC
GACGCCGGCG CGCAGCACGT GCAGATCTTC GACGACAGGT TCCGGCTGCT GCTGCTCTTC
GCCGGCCCGG GGACTCCCGG TTCGCTCAAC ATCCCCGCGG GGATCACGGT CTCCACCGAC
AACCTGGACT ACTACCAGAC GCTCGCCGAT CCCGACTTCA AGCTGGAGAA GGTCATCTTC
GTGGTGAGCC AGGTGGGAGA GCACAAGGTG AGCGTCTACG GCCTCGGGAA AAAAGAGGGG
ATCGATTACG CCGCCGAGGA AAAGAAGACC ATGGAGGACG TGAAGAAAAG GGCGGCCGAG
GCGGCGGAGC GGCGGCGCAA GCTGGAGGAG GAGAAGGCGG CGAAGGAGCT CGAAGGTGGC
GAGACGAAAG CCGCGGCTCC CGCGCCGGAG CCGGCGGCAC AGGCTGCGGA CGAGCCGGTC
AACATCCCCT GGGCCGGGAA AGCGGCCGGC GCCACCCCCC CTGCGCGCTA G
 
Protein sequence
MQANRGDRKV RSVVRMLALV LLSACMTACA GKTAKNSVFW PGAPDLPRIQ FLTAFKDSKD 
VIGEKKLSLL DVGGTPDIFI NLVKPYGITS ANGKLYICDT LQADVITVDL PNKKMTRLSG
NVNAGRLKKP VNVAVDARGN IYVADTSRLE VLQYAPDGSY VRSIGTSKDL KPVDVRVDDL
YLYILDGMTS QVHLYDIAGG DYVKSIGRND DPKRNLAGPT NMALDSKGGV YVSNFGSGRI
IKLDRDGNFH LGFGKLGSSF ADFTRPRGIT VDETGLVYVV DAGAQHVQIF DDRFRLLLLF
AGPGTPGSLN IPAGITVSTD NLDYYQTLAD PDFKLEKVIF VVSQVGEHKV SVYGLGKKEG
IDYAAEEKKT MEDVKKRAAE AAERRRKLEE EKAAKELEGG ETKAAAPAPE PAAQAADEPV
NIPWAGKAAG ATPPAR