Gene Mnod_6003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_6003 
Symbol 
ID7305363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp6108129 
End bp6109259 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content77% 
IMG OID643603626 
Productprotein of unknown function DUF201 
Protein accessionYP_002501133 
Protein GI220925831 
COG category[R] General function prediction only 
COG ID[COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.866607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCG ACGGCGATGC GATCCTGATC GCCGCGCAAT CCGGGCGGGC CCTCGCGGCG 
GCGGCGCGGC GGGCGGGCCT GCGCCCCTTC GTGGCCGACC TGTTCGGGGA CGAGGACATG
CGGGCGCTCG CCGCCGGCTA CCGGGCGCTG CCGGGCCGCT TCGGCGCCGG ACCGGCCGCG
CGGGGCGTGA TCGCGGCCCT CGATGCGCTC GCCGCCGAGG CGGGCACGCC TCTCGGGGTG
GTGCTCGGCA GCGGCTTCGA GAGGGCACCC GCCCTGATGC GGGCCATCGC GGCGCGCCAC
CGCCTGATCG GCGCCGCCCC CGCCACCGTC GCGGCCCTCA AGGACCCGGC GAACCTCGCC
GCCCTGTGCG CGCGCCTCGG CATCCCGCAC CCGCCGCTCA GCCTCGAGGC CGTGCCCGAT
CCGGAGAACT GGCTGCTCAA GCGCCGCGGC GCCTCGGGGG GCGGCCATAT CCGGCCGGCG
GGGCCGGGCC CCGTGGCCCG GGGCGCCTAT CTGCAGCGCC GGATGCCCGG CACCGCACGG
TCCCTGACAA TCCTCGCCGA CGGGCGCCGG ATCCTCGTTA TCGCCGACAC GGCCCAATGG
ACCGCCCCGA GCCCGGCGCG GCCCTTCCGC TACGCCGGCG CCGTCGAGCC CGGCGGCATG
CCGCCGGGCG TGCGGGAGGC CGCGACGGCG GCCGTCGCCG CCCTGGTGGA GGAAACGGGG
CTCTGCGGCC TCGCCAGCGC CGATTTCCTG GTCGACGGCA CCGACTGGTG GCTCCTCGAG
ATCAATCCGC GTCCCGGCGC CACCCTGGAC GTGCTCGACC GCCGCACCGA ACCGCTCCTC
GCCCGCCACA TCGACGCAGC CGGCGGCCGG CTCGGCGCCA CCCTGGCCCT TCCTCCAGAT
GCGGTCGCCA CGCAGATCTG CTACGCCGTT GAGCGCATTC CGCTAGTGCC GCCCCTGGCT
TGGCCGGACT TCGTGATGGA CCGGCCGCTT GCCGGCAGCC GGATCCCCGC CGGGGCGCCG
ATCTGCACGG TGCGGGCCTC GGGGCCCGAC GCGCAGGCCG CCCGAAGCGA GGTCCGAGCC
CGAGCCGAGG CCGTGCGGGC CTTAATTCAC CGCGAGGGAG ACCGTCCATG A
 
Protein sequence
MARDGDAILI AAQSGRALAA AARRAGLRPF VADLFGDEDM RALAAGYRAL PGRFGAGPAA 
RGVIAALDAL AAEAGTPLGV VLGSGFERAP ALMRAIAARH RLIGAAPATV AALKDPANLA
ALCARLGIPH PPLSLEAVPD PENWLLKRRG ASGGGHIRPA GPGPVARGAY LQRRMPGTAR
SLTILADGRR ILVIADTAQW TAPSPARPFR YAGAVEPGGM PPGVREAATA AVAALVEETG
LCGLASADFL VDGTDWWLLE INPRPGATLD VLDRRTEPLL ARHIDAAGGR LGATLALPPD
AVATQICYAV ERIPLVPPLA WPDFVMDRPL AGSRIPAGAP ICTVRASGPD AQAARSEVRA
RAEAVRALIH REGDRP