Gene Mmar10_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1833 
Symbol 
ID4286385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2000542 
End bp2001972 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID638141331 
Producthypothetical protein 
Protein accessionYP_757063 
Protein GI114570383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.424586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG TTGACTTGGT ATTCCGTCAG ATCGACGAGC GCACGCGTGA TGTCGCGCTC 
GAGCTGGCAA TCGAGAACAT CCAGCCGGAT GCGGTGCATG TCATCGATGA CGTCCGGCCG
TTTACCGAGT GTGTTGCGCA GATGCTGCGG ATCGAGCATG ATTGCGACTA TGTGGTCTAT
GTCGATGCCG ACTGCCTGAT CCTCGAAAAT ATCCGCCCCT TTATCGACAA TTGCGATGCG
CCCTATGTCG ACGCCTATGT CTCGGACCGT TTCCGCGGTC GCTTGCATTG TGGTGTCCAT
ATCACCCGCA TCGACCTGGT ACGCGCCATG GCAGCGGTCG AGGTGCCGGA GGATGATCTG
AAATACGTGC TGCGGCCGGA GTCGCGCCTG CGCAACCTGG CCATGAAGCC GATGGGCTCG
GCCAAGCAGT TCCGCAATTT CGACATCCTC CATGACCATT TCCAGTCCTT CCACCACATC
TTCATGAAAT ACGCGCTGCG CGAATTGCGC AGCCGGACCA AGATCCAGAG GCAACGTCTG
ACCCGGTCTA TGAAGAAATG GGTGACCAAT GACGGGGAGT TCTGCGACCT GACCATGGCA
CGCCATGCGG TCGAGTATGC GCGGCGGATG GTTCCGGACA GTGCCACCCC TGACGAGGTC
CACGCCTTCA TCACGGCACT GCCCGAGCAC GCCGAATCCG AGCTGGCCCA GCTGGGAATC
GGCGCTGAAG CATCGTTCAC GCGGGATGAC CTGACAGCCT GGCTCGAGGC CAATTCAGAT
CGGGTTGCCT ATGGCACCAA GGCCGACAAG CCGAAAGTAT TCGGGCTGGG CCTGTCACGG
ACAGCGACGC GGTCCCTGAC TGCCGGCCTG CAGATGCTGG GCTTCGATTG CAGCCATTAT
CCGATAGACG AGGACACCTA TATCGAGATT GCCAACGCCC AGTATGACCT GACCCTGCTG
CGCTATTATG ATGGCCTGAC CGATATCACC ACGATCCCGA TCTATCAGCA GCTCGACAAG
CAGTATCCCG GCTCGAAATT CATCCTGACG GTGCGCGACA AGGAAAGTTG GCTGGGTTCG
GTGTCCCGGC ACTATTACAA CCGGCCGGCT TTCAAGGACG TCAACGACCC GGACGAGGAA
GTCCACCTGC GGATGCGCCA ATTCCTGCGG GCGACCGTTT ACGGGTGCTA TAATTATTGC
CCCGAGCGTT TTTCCTGGGT CTATGACCAA CACATCCGCA GTGTGATGGA CTTCTTCAAG
GACCGGCCCG ACGACCTGTT GGTCCTCGAC ATCTGTTCCG GTGAGGGGTT CGAGAAACTG
GCGCCCTTCC TTGATCGGCC GATCCCGGCA GAAGCTTTCC CGCACAAGGG GGCCGTTCTG
TCACGCCGGA TAGCCGAGGA AGCCGCGGCT CAAAAGGCGC GGGTCGCCTG A
 
Protein sequence
MAKVDLVFRQ IDERTRDVAL ELAIENIQPD AVHVIDDVRP FTECVAQMLR IEHDCDYVVY 
VDADCLILEN IRPFIDNCDA PYVDAYVSDR FRGRLHCGVH ITRIDLVRAM AAVEVPEDDL
KYVLRPESRL RNLAMKPMGS AKQFRNFDIL HDHFQSFHHI FMKYALRELR SRTKIQRQRL
TRSMKKWVTN DGEFCDLTMA RHAVEYARRM VPDSATPDEV HAFITALPEH AESELAQLGI
GAEASFTRDD LTAWLEANSD RVAYGTKADK PKVFGLGLSR TATRSLTAGL QMLGFDCSHY
PIDEDTYIEI ANAQYDLTLL RYYDGLTDIT TIPIYQQLDK QYPGSKFILT VRDKESWLGS
VSRHYYNRPA FKDVNDPDEE VHLRMRQFLR ATVYGCYNYC PERFSWVYDQ HIRSVMDFFK
DRPDDLLVLD ICSGEGFEKL APFLDRPIPA EAFPHKGAVL SRRIAEEAAA QKARVA