Gene GM21_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3013 
Symbol 
ID8138359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3501095 
End bp3502123 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content63% 
IMG OID644870614 
Productcytochrome d ubiquinol oxidase, subunit II 
Protein accessionYP_003022800 
Protein GI253701611 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones100 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGC ATATCGTTTG GTTTGTGCTT TGGGGTGTTC TGTGGGGGGT CTATTTCATG 
CTGGACGGGT TCGTGCTGGG AGGGGGCATG CTGCACCGCG TGCTCGGCAG AGACGATACC
GACCGGCGCG TGCTCATCAA CGGCTACGGC CCTGTCTGGG ACGGCAACGA GGTCTGGCTG
GTCACCGCCG GGGGCGCCAC CTTCGCCGCC TTCCCGACCA CCTATGCGCT CATGTTCAGC
TACCTGTACA CCCCGTTGTT GCTGCTTCTT TTCGCGCTCA TCGTGCGCGG GGTCTCCTTC
GAGTTCCGCG GCAAGGAAGA CGGCGCGCTC TGGAAGGGGT GCTGGGACTG GGCCATAGTC
ATCTCCAGCT TCATCCCGGC GCTTCTTTTT GGTGTCGCCT TCGGCAACAT CTTCGCCGGT
CTCCCCATGG ACGAGGCGGG CTACCACGGC TCTCTCATCT ACTTGTTGAA TCCCTACGGC
GTGGTGAGCG GCCTTCTCTT CGTATTGCTC TTCCTGGAGC ACGGCGCGCT CTACGCGGCC
CTGAAGAGCA CCGGCGCCCT GAGCCGTCGC GCCGAAGAGA TGGCGAAGGC GCTCTGGATT
CCGCTTCTGG TGGTGGCGGT GGGCTTTTTG GGCTACAGCA ATTTCGCAAC GAAGCTCTAC
GACAACTACC TCGCCGCACC CGTCTTCGCC GTGGTGCCGC TGCTGGCCGT GGCCGCGCTC
TTGGCGGTGC GCCTGTTCCT TGCCAAGGGG AACCCCCTGG CGGCTTTCGC GGCGTCCTGC
GCCACCATAC TGGGGGTTGT TTTCACCGGC GTCATCGGGC TCTTCCCGAA TCTGATCCCC
TCGAACCTCG ACGCGCTTTA CAGCCTCACC ATATACAACA GCTCCTCCTC CGATTACACG
CTGCGCATCA TGACCGTCGT CGCCTTCATC TTCGTGCCGA TCGTGATCGC CTACAAGATC
TGGGTCTACC GGCTCTTCCG GGGGCGGGTC AGCGCCGAGA CCCTGGCCGG GGACCACGAG
GCTTACTAG
 
Protein sequence
MDLHIVWFVL WGVLWGVYFM LDGFVLGGGM LHRVLGRDDT DRRVLINGYG PVWDGNEVWL 
VTAGGATFAA FPTTYALMFS YLYTPLLLLL FALIVRGVSF EFRGKEDGAL WKGCWDWAIV
ISSFIPALLF GVAFGNIFAG LPMDEAGYHG SLIYLLNPYG VVSGLLFVLL FLEHGALYAA
LKSTGALSRR AEEMAKALWI PLLVVAVGFL GYSNFATKLY DNYLAAPVFA VVPLLAVAAL
LAVRLFLAKG NPLAAFAASC ATILGVVFTG VIGLFPNLIP SNLDALYSLT IYNSSSSDYT
LRIMTVVAFI FVPIVIAYKI WVYRLFRGRV SAETLAGDHE AY