Gene M446_3267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3267 
Symbol 
ID6132130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3619865 
End bp3621583 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content71% 
IMG OID641643454 
Productrespiratory-chain NADH dehydrogenase domain-containing protein 
Protein accessionYP_001770106 
Protein GI170741451 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00072788 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.294343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC ACGATGTGTC GCGCGTCAAG CGCTTCGAAC ACCCCGGACG CGGTCGCGAG 
CGGGCGAGGC CGGTGCCGAA GGGGCGCCAG GTCGATCCCC GGGCGAAGGC CGAGATCGCC
GCGCTCCTCG GCGAGGCGCC GCGCCGCCGC GACCTCCTCA TCGAGCACCT GCACCTCGTC
CAGGACACCT ACGGGCAGAT CAGCGCCCCC CACCTCGCGG CGCTCGCCGA CGAGATGGGG
CTCGCCTTCG CGGAGGTGTT CGAGACCGCG ACCTTCTACG CCCATTTCGA CGTGGTGAAG
GAGGGCGAGG CGGCGATCCC GGCGCTGACC GTGCGGGTCT GCGACAGCCT CACCTGCGCG
ATGCACGGCG CCGAGGAGCT CCTCGCCGCG CTCCAGGCGG AGATCGGCGC GCAGGTGCGC
GTGGTGCGCG CGCCCTGCGT CGGCCTGTGC GACCACGCCC CGGCCGCCGA GGTCGGGCAC
AATTTCCTGC ACCGGGCCAC GGTCGAGACT GTGCGGGCGG CGGTCGCGGC CGGCGACACG
CACGCGCATC TTCCCGACAC GATCGATTTC GACGCCTACC GGGACGCGGG CGGCTACCGG
ACCCTGGAGC GCCTGCGCGC CGGCGATCTC TCGGTCGAGG ACGTGCTCAA GGTCCTCGAC
GACGGCTCGC TGCGGGGCCT CGGCGGGGCG GGCTTCCCGA CCGGTCGCAA GTGGCGCTCG
GTGCGCGGCG AGCCCGGCCC GCGCCTGATG GCGGTGAACG GCGACGAGGG CGAGCCCGGC
ACCTTCAAGG ACCAGCTCTA CCTCAACACC GACCCGCACC GCTTCCTCGA AGGCACCCTG
ATCGGCGCCC ACGTGGTCGA GGCCGAGGAC GTCTACCTCT ACATCCGCGA CGAGTACCCG
ATCGCCCGCC AGATCCTGGC GCGGGAGATC GCCAAGCTGC CGCCGGGCGG GCCGCGCCTG
CACCTGCGGC GCGGGGCCGG AGCCTATATC TGCGGCGAGG AATCCTCGCT GATCGAGTCG
ATCGAGGGCA AGCGCGGCCT GCCGCGCCAC AAGCCCCCGT ACCCGTTCCA GGTCGGCCTG
TTCGGGCGCC CGACGCTGAT CAACAACGTC GAGACGCTGT ACTGGATTCG CGACCTGATC
GAGCGCGGCC CGGATTGGTG GAAGAGCCAC GGCCGCAACG GCCGCACGGG CCTGCGCTCC
TACTCGGTGT CGGGCCGGGT GAGGGAGCCG GGCGTGAAGC TCGCGCCGGC CGGGGTCACC
ATCCGCGAGT TGATCGAGGA ATTCTGCGGC GGCATGGCCG AGGGCCACCG CTTCGCGGCC
TACCTGCCGG GCGGCGCCTC GGGCGGCATC CTGCCGGCCG CGATGGACGA CATCCCGCTC
GATTTCGGGA CCCTCGAGAA GTACGGCTGC TTCATCGGCT CGGCCGCCGT GGTGGTGCTC
TCCGACAAGG ACGACATCCG CGGTGCGGCC CTCAACCTGA TGCGGTTCTT CGAGGACGAG
TCCTGCGGCC AATGCACGCC CTGCCGGGTC GGCACCCAGA AGGCCCGCAT GCTGATGGAG
AGCGGGGTCT GGGACACCGA CCTCCTCGGC GAGTTGTCGC AATGCATGCG CGACGCCTCG
ATCTGCGGCC TCGGCCAAGC GGCCTCGAAT CCGCTCACCA GCGTCATCAA GTATTTCCCG
GACCTCTTCC CGACCCCGCG GCCCATGGCG GCCGAGTAG
 
Protein sequence
MSQHDVSRVK RFEHPGRGRE RARPVPKGRQ VDPRAKAEIA ALLGEAPRRR DLLIEHLHLV 
QDTYGQISAP HLAALADEMG LAFAEVFETA TFYAHFDVVK EGEAAIPALT VRVCDSLTCA
MHGAEELLAA LQAEIGAQVR VVRAPCVGLC DHAPAAEVGH NFLHRATVET VRAAVAAGDT
HAHLPDTIDF DAYRDAGGYR TLERLRAGDL SVEDVLKVLD DGSLRGLGGA GFPTGRKWRS
VRGEPGPRLM AVNGDEGEPG TFKDQLYLNT DPHRFLEGTL IGAHVVEAED VYLYIRDEYP
IARQILAREI AKLPPGGPRL HLRRGAGAYI CGEESSLIES IEGKRGLPRH KPPYPFQVGL
FGRPTLINNV ETLYWIRDLI ERGPDWWKSH GRNGRTGLRS YSVSGRVREP GVKLAPAGVT
IRELIEEFCG GMAEGHRFAA YLPGGASGGI LPAAMDDIPL DFGTLEKYGC FIGSAAVVVL
SDKDDIRGAA LNLMRFFEDE SCGQCTPCRV GTQKARMLME SGVWDTDLLG ELSQCMRDAS
ICGLGQAASN PLTSVIKYFP DLFPTPRPMA AE