Gene Mmar10_2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2733 
Symbol 
ID4286046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp3004234 
End bp3005751 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content63% 
IMG OID638142232 
Producttryptophan halogenase 
Protein accessionYP_757957 
Protein GI114571277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA CACCGAAGGG CCCGGCGCCC GGATTGCCCC GCATTGCCAT TATCGGCGGA 
GGCTCGGCAG GATGGATGAC GGCGGCCGCG ATCATCAATG CCACGAAGGG CGCGGCCTCG
CTCACCCTGG TTGAGTCAGA ACAGATCGGT GTGGTCGGGG TGGGGGAAGC GACTATCCCG
CCGATCAAGC TGTTTAACCA GATGCTCGGG ATCGACGAAA ATGACTTCGT GCGAGCCACC
AATGGTTCTT TCAAGCTGGG CATCGAGTTT GTCGACTGGT CCCGCAAGGG GCAGCGCTAT
TTCCATCCGT TCGGCACACA TGGCCGGGAT TTCGACTCCG TCCCGCTCTA TCAATACTGG
TTGCGCGAGC GTAAGCGGGG CGATGACACC CCGCTCGACG CCTATTCAAT GGCCTGGGAG
ATAGCCCGGC AAAACCGCTT CTCACCCCCC GCCAGGGACC CTCGCCTGGT GCAGTCTACC
TTCGACTACG CCTATCATTT CGATACCATC TTGTACGGTC AGTTCCTGCG GCGTTATGCC
GAAACGCGTG GCGTGGTGCG CCAGGAAGGC CGTGTGGTCG ATACGCGCCG GACCGAGACG
GGCGACGTGG AGGCGGTGAT GCTCGAAGGC GGGCGCGCCG TGGAGGCCGA CTTCTTTATT
GATTGCACCG GGTTTTTCGG CCTCTTGATC GAGCAGGTCC TGGAGACCGG CTATGAGGAC
TGGACCCATT GGCTGCCTTG CGACCGCGCC GTCGCAGTGC CCTGTGAGGG CGTCGGTGAT
TTCACGCCCT ATACCCGGTC GACCGCCCGC GAGGCCGGCT GGCAGTGGCG CATCCCGCTT
CAGCACCGGA CGGGCAATGG CCATGTCTAT GCCAGCCAGT TCATCAGCGA TGAAGCGGCA
ACCGACACCC TGCTCGCCAA TCTGGACGGC GAGCCGCTGG CCGATCCGCG GCTCTTGCGC
TTCACCACTG GCCGGCGGCG AAAATTCTGG AACCGCAATG TCGTCGCGCT CGGTCTTTCA
GCCGGGTTCA TGGAACCGCT TGAGTCGACC AGCCTCCATC TAATCCAGAC AGGCATCAAT
CGCTTGTTGG CTTTGTTTCC TGGCACCGGA GACACACAGA AGGAGGCTGC CGAATTCAAT
CGGCTGACCG GAGAGGAATA CGAGCGCATC CGCGACTTCC TGATCCTCCA CTATCATGCC
ACGACCCGGG ATGACGCGCC GCTCTGGCGC CACACAGCCA ACATGGCGAT CCCCGACAGC
CTCGCCTGGC GCATGGAACA CTATCGCGCC AATGGTCGCC TCGTATCGCC CGGGACCGAA
CTGTTTCTCA ATCCGTCCTG GATGGCCGTC TATGCCGGTC AGGAGATCGA ACCGGCCGGC
CTGGACCCGT TGGCGGCCGC CAGTCCGGTG GACGGTGCGC AACGACTCGC CGGATTGCGC
AGGGTGATGG CGGAGGCCAC GGCGCCGGTT CCCGATCACC GGGACTATAT CGAGCGCTTC
TGCACAGCTG CGGTCTAG
 
Protein sequence
MSTTPKGPAP GLPRIAIIGG GSAGWMTAAA IINATKGAAS LTLVESEQIG VVGVGEATIP 
PIKLFNQMLG IDENDFVRAT NGSFKLGIEF VDWSRKGQRY FHPFGTHGRD FDSVPLYQYW
LRERKRGDDT PLDAYSMAWE IARQNRFSPP ARDPRLVQST FDYAYHFDTI LYGQFLRRYA
ETRGVVRQEG RVVDTRRTET GDVEAVMLEG GRAVEADFFI DCTGFFGLLI EQVLETGYED
WTHWLPCDRA VAVPCEGVGD FTPYTRSTAR EAGWQWRIPL QHRTGNGHVY ASQFISDEAA
TDTLLANLDG EPLADPRLLR FTTGRRRKFW NRNVVALGLS AGFMEPLEST SLHLIQTGIN
RLLALFPGTG DTQKEAAEFN RLTGEEYERI RDFLILHYHA TTRDDAPLWR HTANMAIPDS
LAWRMEHYRA NGRLVSPGTE LFLNPSWMAV YAGQEIEPAG LDPLAAASPV DGAQRLAGLR
RVMAEATAPV PDHRDYIERF CTAAV