Gene Mmar10_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0201 
Symbol 
ID4284016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp213833 
End bp215746 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content56% 
IMG OID638139667 
ProductCopA family copper resistance protein 
Protein accessionYP_755435 
Protein GI114568755 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01480] copper-resistance protein, CopA family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.978891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATGA TATCTCGGCG ATTGTTTTTG GGCGCGTCTG CAGCGGTAGG CGGCGCCGCG 
ACGACCGGCC TACTACCAGC TTGGGCACAA TCCGCTGGCG CTGCAAATCT TAATGGCTTG
AGAGCGCTGT CCGGTCAAAT TTTCGACCTC ACGATTGGTC ACGCGAATGC ACTGATTGAC
GGGCAGGCGG GAGATGCGAT CACAGTCAAT GATCAATTGC CCGCCCCGTT ACTTCGGTGG
CGTGAAGGCG ATGAGATCAC TTTGCGTGTT CATAATACGC TCAGTGAAGA CACATCTATC
CATTGGCACG GACTGTTATT GCCGTTTCAG ATGGACGGTG TGCCCGGTGT GACGTTTCCG
GGTATCCGCC CAGGTGAGAC ATTCACCTAT CGCTTTCCGA TCCGACAGGC GGGCACCTAC
TGGTATCACA GCCATTCTGG ACTTCAGGAA CAAATGGGCC ATTACGGGCC AATTATCATT
GAACCGGCTC GACCGGATCC AGTGGCCTAT GACCGCGAAT ATGTTGTCGT CCTCTCCGAC
TATACGTTCG AGGGGCCTCA TCGTGTTTTC GAGAAGCTCA TGAAAATGAG CGACACTTAC
AACTTCCAAC AACGCACACT GTCAGATTTC GTTGAGTCGT CCCGTCAAAA TGGACTGCTT
TCCGCGCTCC GCGATCGCAC CATGTGGGGT CAGATGCGCA TGAGTCCGAC CGACATATCC
GACGTCACGG CCGCCACACT TGAATATCTC GTCAATGGTC ATGGTCCCGC GGACAATTGG
ACGGGTCTCT TTACACCTGG AGAGCGCGTG CGATTGCGCT TCATCAACGC TTCGGCGATG
ACGATCTTCA ACGTGCGCTT CCCAGATCTG CCGATGACAA TCGTTCAAGC GGACGGCCTG
AATGTTCAAC CTGTTGAGAT AGACGAATTC CAGATTGGAG TCGCTGAAAC TTACGATGTC
ATCGTGCAGC CGCAGGACGA TCGTGCCTAC ACGCTGATGT GCGAGTCGAT CGATCGTAGC
GGGTTCGCCC GCGCAACTTT GGCACCGCGC ACCGGTATGG TGGCAGCAGT ACCGCCGCTT
CGGCCGCGAC CAACCCTGAC GATGCAAGAC ATGGGCATGG ACCACGGCGC TATGGGACAT
GAAGGTATGG ACCACGGTAG GCCTTCGTCG CCGAGCGCCG ATCATGCTGC GATGGGTCAC
ACACCATCGT CAGGTTCACC GAATACCAGC GGCCACTCTG GCCACATGGA TCATGCGTCG
ATGGGTCATG ACGCTCAACC GGTTAATGAG ACCCACATGT CACCGATGCC AATAAGTGGA
CAGCAACGCC ACGACCACCC GCGGGGCCCG GGCGTTGCGA ATGTCGCGAT GCAACCGACA
TCAAGGCTTG GTGAACCGGG CGCGGGGCTT ACCGATGTTG ATCATCGGGT GCTGGTTTAC
ACCGATCTAA AAAGTCTTGA ACGAAATCCG GATACCCGTC CGCCGGGGCG AGAAGTCGAG
GTTCATCTCA CTTCCAATAT GGAACGCTAT ATGTGGTCGT TTGACGGGCG CCGCTGGAGT
GAAGTGGTCG ATCCCATCCA GTTTTACCAA GGCGAACGCG TACGCTTGAC GATGGTCAAC
GACACTATGA TGCCGCATCC GATCCATCTT CACGGTATGT TTTTCGATGT CGTAAACGGC
GAAAGCGCCC ACAAACCGCG TAAACACACC ATCACCGTTA AGCCCGGTGA AAAACTCTCT
GTGGACGTCA CCCCCGAAGA CGTCGGTGAT TGGGCCTTCC ACTGTCATCT TCTCTACCAC
ATGCACGCTG GCATGTTCCA AGTGGTTTCA GTCCTCCCAG AAGAAACGGG ACATTCAATG
CATGATCCTC AAAACGGCCA GCACCAAGGT CATAACCCGC ATGGAGATCA TTGA
 
Protein sequence
MRMISRRLFL GASAAVGGAA TTGLLPAWAQ SAGAANLNGL RALSGQIFDL TIGHANALID 
GQAGDAITVN DQLPAPLLRW REGDEITLRV HNTLSEDTSI HWHGLLLPFQ MDGVPGVTFP
GIRPGETFTY RFPIRQAGTY WYHSHSGLQE QMGHYGPIII EPARPDPVAY DREYVVVLSD
YTFEGPHRVF EKLMKMSDTY NFQQRTLSDF VESSRQNGLL SALRDRTMWG QMRMSPTDIS
DVTAATLEYL VNGHGPADNW TGLFTPGERV RLRFINASAM TIFNVRFPDL PMTIVQADGL
NVQPVEIDEF QIGVAETYDV IVQPQDDRAY TLMCESIDRS GFARATLAPR TGMVAAVPPL
RPRPTLTMQD MGMDHGAMGH EGMDHGRPSS PSADHAAMGH TPSSGSPNTS GHSGHMDHAS
MGHDAQPVNE THMSPMPISG QQRHDHPRGP GVANVAMQPT SRLGEPGAGL TDVDHRVLVY
TDLKSLERNP DTRPPGREVE VHLTSNMERY MWSFDGRRWS EVVDPIQFYQ GERVRLTMVN
DTMMPHPIHL HGMFFDVVNG ESAHKPRKHT ITVKPGEKLS VDVTPEDVGD WAFHCHLLYH
MHAGMFQVVS VLPEETGHSM HDPQNGQHQG HNPHGDH