Gene Mmar10_2649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2649 
Symbol 
ID4285980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2886289 
End bp2888298 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content62% 
IMG OID638142148 
Productsulfotransferase 
Protein accessionYP_757873 
Protein GI114571193 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGAAG AGACGGACAT CCATGTCCAG GCGCTGACGG CAGCGCAAAC GCAGATGTAT 
GAAGGCCGCT TTGATGCGGC CCGGGAGACG CTGCAACCGG TTCTGGATGC GTCACCGGAC
CATGTCGATG CGCTCTACAT GCAGGCGGTC TGCGCCCGTT ACCTCAAACG CCATGACGAA
GCCCGCGCCG CTCTCGAGCG TATCAAGGGC GTCTCGCCAG ATTTTGGCCG TGCCTATCAG
GAAGAAGGCC ATCTCCTGCG CGCGCTGGGC GAGGATGATC GCGCTCTGAC TGCCTATCAA
CGCGCTTGCC GGTTCAACCC CGCCCTTGTT GCCAGCTGGC GGGCTCAATC AGACCTCTTG
CAGGCGGCAG GGCGTCCGGC GGAGGCCAGC AACGCAGCGG CACAGGCGGA ACGGATTGCC
GCACTGCCGC GGGACCTGGT CTCGGTCACC CATCTCCTGC ACGAGGGCAA GCTCCTGAAG
GCCGAACAGC TGTGTCGGGC CTTTCTCCAG AAGACCCCGC ACCATGTCGA AGCCATGCGA
TTGCTGGCGG AAATCGGTTC GCGCTTCGGC GTGCTTGAAG ATGCTGACTT CCTGCTCGAG
AGCGCCATCG GTTTTGAACC GGACAACACC CAGCTGCGGC TCGACTACAT CCAGATCCTG
CGCAAGCGCC AGAAGTTCGC GGCCGCCCTC GAACAGGCCC GACAGCTCTG GGAGACCGAT
CGCGATAATC CGGTCTTCAA ATCCCACTAC GCCATCGAGC GCATGCAGAC CGGCAGCTAT
GACGAAGCCC TGACCCTGTT TGAGGAGATA CTCGTCACAC TGCCCGACGA CCCGGCGACG
CTGACCTCAC TCGGGCACGC CCAGAAGACA CTTGGCCAGC ATGACGCCGC GGTTGCCAGC
TACCGGGCGG CCTTTGCAGC CAAACCTGAC CATGGCGACG CCTGGTACGG GCTGGCCAAT
CTGAAGACCT ATCGCTTCAC CGATGAAGAG GTCGCGTCCA TGCAGGCGCT GGAAGCTGGC
AGCGATCTGG CTTTCCAGGA CCGGGTCCAT CTCAGCTTTG CCCTCGCCAA GGCCTTCGAG
GATCACGAAG ACGTCGCGCA GGCGTTCGAC TTCTACGAAA AGGGCAATAC GCTCAAGCGG
GTCCAGACCC GCTACACCAC CGAGCAGATG AAGGCCGAGC TCGATGCCCA GGCCGAAATC
TGTGACGCGG CGCTGTTTGC CCGCCAGTCG GGCAAGGGAT GCGCCGATCC TGATCCAATC
TTCATTGTTG GACTGCCCAG GGCCGGCTCC ACCCTGCTGG AACAGATCCT GGCCTCGCAC
AGCCAGGTCG ACGGCACGTT GGAACTTCCC AACATCCTGG CCCTGTCGCA TCGATTGCGC
GGACGACAGC GGTTGAGTGA CAAGACTCGC TATCCTCGGG TGCTCCACGA GCTGGATGCC
GGGCAACTGG AAGCGCTGGG CCGGGACTAT ATCGAAAACA CCCGCATCCA TCGTGCCGGT
GCGCCGCGCT TCACCGACAA GATGCCAAAC AATTTCCGGC ATATCGGCCT GATCAAACTG
ATCCTGCCCA ATGCCCGCAT CATTGATGCC CGCCGGCATC CGATGGCCTG CTGTTTCTCC
GGCTTCAAGC AATTATTCGC CGAGGGCCAG GAATTCACCT ACGGGCTCGA GGAAATCGGG
CACTATTACC GCAATTATGT TGCGCTGATG GATCATTGGG ATCGGGTTCT TCCCGGCCAG
ATCCTGCGCG TGAACTATGA GGACGTCGTG TCTGACCTTG ACGGGCAGGT TCGCCGTATT
CTCGACTATT GCGGCCTGCC ATTCGAGCAG GCCTGTATCG ACTTCCACGC GACCGAGCGG
GCGGTCCGGA CGGCCAGTTC GGAACAGGTC CGCCAGCCGA TCTTCGACGC CGGTGTGGCG
CAGTGGAAGA AATTCGAACC CCATCTCGAC CCGCTGAAAA GCGCGCTGGG TAAAGACATA
CTGGCCCGAG CGGGACAAGG AACATCATGA
 
Protein sequence
MAEETDIHVQ ALTAAQTQMY EGRFDAARET LQPVLDASPD HVDALYMQAV CARYLKRHDE 
ARAALERIKG VSPDFGRAYQ EEGHLLRALG EDDRALTAYQ RACRFNPALV ASWRAQSDLL
QAAGRPAEAS NAAAQAERIA ALPRDLVSVT HLLHEGKLLK AEQLCRAFLQ KTPHHVEAMR
LLAEIGSRFG VLEDADFLLE SAIGFEPDNT QLRLDYIQIL RKRQKFAAAL EQARQLWETD
RDNPVFKSHY AIERMQTGSY DEALTLFEEI LVTLPDDPAT LTSLGHAQKT LGQHDAAVAS
YRAAFAAKPD HGDAWYGLAN LKTYRFTDEE VASMQALEAG SDLAFQDRVH LSFALAKAFE
DHEDVAQAFD FYEKGNTLKR VQTRYTTEQM KAELDAQAEI CDAALFARQS GKGCADPDPI
FIVGLPRAGS TLLEQILASH SQVDGTLELP NILALSHRLR GRQRLSDKTR YPRVLHELDA
GQLEALGRDY IENTRIHRAG APRFTDKMPN NFRHIGLIKL ILPNARIIDA RRHPMACCFS
GFKQLFAEGQ EFTYGLEEIG HYYRNYVALM DHWDRVLPGQ ILRVNYEDVV SDLDGQVRRI
LDYCGLPFEQ ACIDFHATER AVRTASSEQV RQPIFDAGVA QWKKFEPHLD PLKSALGKDI
LARAGQGTS