Gene GM21_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4099 
Symbol 
ID8139473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4680468 
End bp4682807 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content64% 
IMG OID644871714 
Productmulticopper oxidase type 2 
Protein accessionYP_003023872 
Protein GI253702683 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones125 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG TATGGTTAGC CGGTGCCGTT GGCACAGTCG CACTCATCGC CTCGTGTTTG 
ACAACAGCGT GGGCCGGAGT TCACAACCAG GTGCAGTTGC CTGGTTCGGC GATACCGCAG
TTCGTCGATC CGTTGCCTGA CTTGGATCTT GTTCCCGCCG GCGCGGGGCA GATCGAACTG
CGCATGAAGG AATTCAAGAG CATGATCCTG CCGGCCAACG CCGTCCCCGG ATACACCGGG
ACCTGGGTCT GGGGGTATCT GCAGCCCGGA CAGGTGGCGA ACCAGCTGCC GGACGGATCG
ATTCCCGGGC GCCACTCCTA CATCGGCCCG GTCATTGCCG CCACCCGCGG CGTCCCCACG
GAAATGAAGT TCGTCAACGA GCTTCCCACG GTGAACGCCA CCAACGTGCT TGCCTACAAG
TACGGCACGG ACCAGACCAT GCACTGGGCC GATCCTTTGA ACATGGGTAT GAACATGTGT
TCGCACATGT CGATGCCGTC AGCTTTCGGG AGTGAATGCT CCTTCAACTA CGGCGAGTTC
GCCGCCGGGC AGTTCGGTGA TGTGCAGATC CCGGCAGCCG TGCACCTGCA TGGGGGTGAA
GTGCCGCCGC AGCTCGATGG CGGCCCCGAA TCCTGGTTCA CCAACGACGG CCCGATGGGT
GTCCTCAAAG GCGGTTCCTA CTACAGCAGG GACGGTGCCG CTGCAACCAA TTACGCCATC
TACCGCTATC CCAATACCCA GGAAGCTGCT CCCATCTGGT TTCACGACCA CACCCTGGGC
GCCACCCGCC TGAACGTCTA CGCGGGGCTG GCCGGGGCCT ACCTGCTGTC GGACCCCGCC
AAGGATCCGG CGAACCTCCC TGAACTCGTG CCGCTGGTGC TCCAGGACCG CATGTTCGAT
ACCACCGGCC AGCTCTTCTT CACCGCCGGG AACGCAGGAG GGCTCCTCTG GGCGCTCAAC
CCCGAGCACC CTTACTGGAA CCCGGAATTT CTTGGCGACG TCATGGTGGT GAACGGCAAG
TCCTGGCCCT ATAAGAACGT CGAGGCCAAG CGCTACACCT TCCTGTTCCT CAACGGGTCC
AATGCGCGGA CCTACGAAAT GAACCTGACC GACCCGGTCT CCAAGAACCC GGGCCCGGCG
CTGTGGGTCA TCGGCACCGA CGGCGGTTAT CTCGATGCGC CGGTGCAGCC GGCCAAAGGG
AAACTGGTTA TCATGCCGGG CGAACGCTAC CAGGTCATCG TCGACTTCGC CGGATACCAG
GCGGGGCAGG TCGGCCCCAA CGGCCTTCCC TATTCCGGAA AGTGGCTGCT CAAGAACACC
GCCAAGGCAC CGTACCCGGG GGGCGCAGCC CCCAGCGGCA ACATTGAGGG AAGGATCATG
CAATTCGTGG TCGGCGCAGT ACCGGCAACC GATGCTTCCT TCAACCCCGC GCTCCCCGGC
GCGACCCTGC GCCAGTCGAT GGTGCGGCTG GTCAACCCGG CAACCGGCAC CGTCGCCGTC
CCGGTCCAAA AGACGCGCCA GTTGACCCTG AACGAGGTCA TGGGGATGCC GCAATTGGCT
ACCGATCCCA TCACCGGGTT GCCCGTTGCC TATCCCGGCG GCCCGCTTGA GGTCCTGGTC
AACAACACCA AGTGGAACGG CAAGCGGATC AACGGGGTGG ACGCGGCAAC CGGGGACTAT
ACCTTCGAGC CCATTCCGGG GCTGACGCTC GATGCCAAAG GGACCAACTA TATCTCCGAG
CGCCCCAACG AAGGCGAGAC CGAGGTCTGG GAGATCGTGA ACCTGACCGC GGACGCTCAC
CCGATTCACC TGCACCTGGT CCAGTTCCAG TTGGTCGACA GGCAGAACTT CGACGCCAAG
GGGTACACCG CCGCATACAA CCTGGCATTC CCGGGGGGCG GTTTCGATCC GATGACCTTG
AAGCCGTACC CTGCCGGGGT CTACATCCCG GGATTCGGGC CGCCGCTAAG CTACGACAGC
GATCCCGCCA ACGCACGGGC CGTGGGGGGC AACCCCGACA TCGCGGCGCT CGTGAAAGGG
AAGCCGGTCT ACCTGCAGGG GCCGGCGGCG CCGGCGCCGC CGCATGAAGC CGGTTGGAAG
GACACCGTCA TGGCGATGCC GGGCCAGGTC ACCCGGATCG CGGTACGCTG GGCCCCGACC
GATCTGGCCG CCGCCACTCA GGCCGCCTCC GCCTTCTTCC CCTTCGATCC CAACGGCGGC
GACGGCTACG TCTGGCATTG CCACATCATC GACCACGAGG ATAACGAGAT GATGAGGCCG
GACGAGGTGA CCCCGAACCC GAATGCGCTC AGGAGCTATG TGAAAGGGGT CGACTTCTAA
 
Protein sequence
MKKVWLAGAV GTVALIASCL TTAWAGVHNQ VQLPGSAIPQ FVDPLPDLDL VPAGAGQIEL 
RMKEFKSMIL PANAVPGYTG TWVWGYLQPG QVANQLPDGS IPGRHSYIGP VIAATRGVPT
EMKFVNELPT VNATNVLAYK YGTDQTMHWA DPLNMGMNMC SHMSMPSAFG SECSFNYGEF
AAGQFGDVQI PAAVHLHGGE VPPQLDGGPE SWFTNDGPMG VLKGGSYYSR DGAAATNYAI
YRYPNTQEAA PIWFHDHTLG ATRLNVYAGL AGAYLLSDPA KDPANLPELV PLVLQDRMFD
TTGQLFFTAG NAGGLLWALN PEHPYWNPEF LGDVMVVNGK SWPYKNVEAK RYTFLFLNGS
NARTYEMNLT DPVSKNPGPA LWVIGTDGGY LDAPVQPAKG KLVIMPGERY QVIVDFAGYQ
AGQVGPNGLP YSGKWLLKNT AKAPYPGGAA PSGNIEGRIM QFVVGAVPAT DASFNPALPG
ATLRQSMVRL VNPATGTVAV PVQKTRQLTL NEVMGMPQLA TDPITGLPVA YPGGPLEVLV
NNTKWNGKRI NGVDAATGDY TFEPIPGLTL DAKGTNYISE RPNEGETEVW EIVNLTADAH
PIHLHLVQFQ LVDRQNFDAK GYTAAYNLAF PGGGFDPMTL KPYPAGVYIP GFGPPLSYDS
DPANARAVGG NPDIAALVKG KPVYLQGPAA PAPPHEAGWK DTVMAMPGQV TRIAVRWAPT
DLAAATQAAS AFFPFDPNGG DGYVWHCHII DHEDNEMMRP DEVTPNPNAL RSYVKGVDF