Gene Mmar10_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0073 
Symbol 
ID4283939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp71128 
End bp72828 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content64% 
IMG OID638139536 
Producthypothetical protein 
Protein accessionYP_755307 
Protein GI114568627 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTG ACGTACATTA CGAGGTCTTC TTCAAGAAGA ACCGCAAGGC GAGCTGGGCC 
CTCCACGAGG CCCGCGATGA CCGCGACCAG GCCATCCGCC TCGCCCATTC GCTCGTCGCC
AAGCAAAAGG ACGCCTCGGT CCGCGTGACC AAGGAGACCT TCGACCAGGA ACACCGCAAA
TTCCGCTCCG TGCCCGTATT CGAACGCGGT GCCGAGATGA TGGGGGCTGA AAAAGAAAAG
ACCGGCGAGG CCCGTCTGCC CTGCCTGACC CCGGACGATC TGGCCAAGCC GCATGCGCGC
GACACGATCC GCCGGGTGCT GACCGGCTGG CTGGAGCGCG TTCAGGCCAT CCCGATGGAA
TTGCTGCACC GGCCCGATCT GGTCGAAAGC CTCGAAGCCT CCGGAACCGA ATTGCAGCAC
GCGGTCCAGA AAGTCGCGAT TGCTTCGGCC AGTGACAGCG ATGCCGGCGT GCACGGCTAT
GTCAAACAGC TCAACGAGCT GGTCCAGAAG TCGCTGGCCC GAATTTACAA GGACGGCCGT
GACAATCGCC TGCCGGAATA TCCCAAAAAG GCGGACTTCG CCGAGATTGC CGGTGAGATC
CACAAGCGGG ACCGGCGCGC CTATTCACTG CGCGCGGCCA TGGCCGACCG CCTGCGCCAC
GAAAAGAAAT ATGGCGACAA GCTCGAAGCC CTGCTGGACA TGGGCGACAA TCTGCCTGCC
GACGAAGACG CCCGCAGCTT CGCCCTGGAC GAAGTCGACA GCTATATTGC CGAAGTCATC
GCCTTTGATG CCGGGCGCGA GGCCCTGTTG GGCAAGTGCA AGGATCTTGG CGAAACCCTC
GAGCGGCTGG CCTGCCTGTT CGATGGCGAC CACTCGGCCG ATGCATTGAA CCTCGCTCCC
AGTGCCGCCA AACGGCTGGC CCGCAAGATC AAGGGCAAGG AGTTTCCAGC CTGTCGCGCG
ACCATCGCCG GCTGCATCCT GAAAGACCTC GAACGCCCCA AACGCCTGCG CCCGAGCAGC
GTCCGCGATG AAGTCCGCCT GGCCCGTGAC CTCGCCAGCC GCCTGGTCAT CTGCGCCGAC
AGCACCCTGC CCGCCGACGC GCTGATCAAG GCCTTCGCCT CACGCTCGGC GCGACTGCTG
CAGCCCGAGA TCATCGATGA ATTGCTGCGT CATTCGCGCG GTGCAGACGA GGAACTCGAC
CGGCTGATCG CCCTCGAGGA AAACCTCGTC GGCGAGAGCA ACAAGCAAAA ACTGGCCGGC
TATATCCGCT CGACACTGGG CTCCAACCAG GCCGATGCCT GGTATGTGCG CGGTGATGCC
AAGCCGCTGG AACGCCTGGC CAAGCTGACA TCGCAGCAGG CAAAAGTGCT CAAGGGCGGA
TACCCCGAGC GCGACAAGCT CGAGCTGGCT GCCAGTTTTG ACGCCATGGG CATGAAGGTC
GTCGACGACA GCAAGATCCT CAACATGGTC GAGGGCGGCG ACCGTCCGGC GCTCGACAAG
GCGACCGGGC TGTTGCGCCT GGCGACCGGC GGCGCGCTGC CGATCGGCAA GTGTTCGGCA
GACGCCCAGG CGCGCGCCTT GCGTCATTTG AAGTCGGCTG TCGGTCTAAG CGAGGCCCAG
GCCGAGGACG GTCGACCCAA GCTGCGACAA ATCCAGGGCA TGCTCCAGGA ATTGACGATC
CTGCAGACAA AGTCCGCCTG A
 
Protein sequence
MAGDVHYEVF FKKNRKASWA LHEARDDRDQ AIRLAHSLVA KQKDASVRVT KETFDQEHRK 
FRSVPVFERG AEMMGAEKEK TGEARLPCLT PDDLAKPHAR DTIRRVLTGW LERVQAIPME
LLHRPDLVES LEASGTELQH AVQKVAIASA SDSDAGVHGY VKQLNELVQK SLARIYKDGR
DNRLPEYPKK ADFAEIAGEI HKRDRRAYSL RAAMADRLRH EKKYGDKLEA LLDMGDNLPA
DEDARSFALD EVDSYIAEVI AFDAGREALL GKCKDLGETL ERLACLFDGD HSADALNLAP
SAAKRLARKI KGKEFPACRA TIAGCILKDL ERPKRLRPSS VRDEVRLARD LASRLVICAD
STLPADALIK AFASRSARLL QPEIIDELLR HSRGADEELD RLIALEENLV GESNKQKLAG
YIRSTLGSNQ ADAWYVRGDA KPLERLAKLT SQQAKVLKGG YPERDKLELA ASFDAMGMKV
VDDSKILNMV EGGDRPALDK ATGLLRLATG GALPIGKCSA DAQARALRHL KSAVGLSEAQ
AEDGRPKLRQ IQGMLQELTI LQTKSA