Gene GM21_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2940 
Symbol 
ID8138283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3419292 
End bp3420383 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content63% 
IMG OID644870538 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_003022727 
Protein GI253701538 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones120 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CGTTACGCAT AGGAAAACAC GAGGCACGCT ACCCGCTCAT CCAGGGCGGC 
ATGGGGGTGC GCATTTCTGC AGGATCTCTC GCAGGGCACG TAGCGAAGTG CGGAGGGGTC
GGCCTTGTCG CTTCACCCGG CATCGTCTTG AACAGCGAGT TTTTCAACGG TTCGAACTAT
CTCAAGAACA GCTCCCTCGC TCTCAAGGAG GAGCTGCGCA AGGCCTACGA GATCGCGCCC
GACGGCATCG TCGGCGTGAA CGTGATGGTC GCCCTCACCG ATTTCGAGGA ACTGGTCGTC
GCCGCCGTCG AGGGTGGCGC CAAGGTGCTC GTCTGCGGGG CGGGACTCCC CTTGACCTTG
CCGGGACTGA CCGCGCACGC TCCCGACGTG GCGCTGGTGC CGATCGTTTC CTCCGTGCGC
GCGGCGCAAC TGATCGCCAA AAAATGGGAC AAGTCCTACA ACCGTCTCCC CGACGCCGTG
GTGGTAGAGG ATCCCGACAC CGCCGGGGGG CATCTGGGCG AAAAGATAGA AAATATCGGC
AACGGCGACT ATGACCAGTA CGAGACCGTG CGCGGCGTCA AGGAATTCTT CCGTACCGAG
TACAACCTCG ACATCCCCAT CATCGCCGCC GGCGGGATCT GGGACCGCGC CGACGTGCTG
CACGCCCTTG CCGAAGGGGC GGACGGTGTG CAGATGGCGA GCCGTTTCGT AACCACCGTG
GAGTGCGACG CGGACGACGC CTTCAAGCAG GCCTACCTGG ACTGCAAGAA GGAGGACATC
GGTCTCATCA TGAGCCCGGC GGGTCTTCCG GGGCGCGCCA TTCTCACCAA CCAGCAGGGG
ATCGTCGACT ACGACCGGGA TCGTGCCTCC TCCTGCAGCC ACGGCTGCCT GAAAAAGTGC
TCCTACAAGG AAAGCGGAGA GCGTTTCTGC ATCGTCAGGT CCCTGGACCG GGCGCAGCGC
GGCGAGGTTG ACAGCGGCCT GATCTTCTGC GGCACCAACG CCTATAAGGC CAACCGTATC
GAGACCGTCC AGGAGATCTT CGACGAGCTC TTCGCCGAAA CGGGCGCCGT CTCCCACGAG
AAAGCCGCGT AA
 
Protein sequence
MFKPLRIGKH EARYPLIQGG MGVRISAGSL AGHVAKCGGV GLVASPGIVL NSEFFNGSNY 
LKNSSLALKE ELRKAYEIAP DGIVGVNVMV ALTDFEELVV AAVEGGAKVL VCGAGLPLTL
PGLTAHAPDV ALVPIVSSVR AAQLIAKKWD KSYNRLPDAV VVEDPDTAGG HLGEKIENIG
NGDYDQYETV RGVKEFFRTE YNLDIPIIAA GGIWDRADVL HALAEGADGV QMASRFVTTV
ECDADDAFKQ AYLDCKKEDI GLIMSPAGLP GRAILTNQQG IVDYDRDRAS SCSHGCLKKC
SYKESGERFC IVRSLDRAQR GEVDSGLIFC GTNAYKANRI ETVQEIFDEL FAETGAVSHE
KAA