Gene Mmar10_2722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2722 
Symbol 
ID4286087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2986370 
End bp2987650 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID638142221 
Producttryptophan halogenase 
Protein accessionYP_757946 
Protein GI114571266 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0792075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00766625 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTGAAC CAATCCGGTC GATATGCATT CTGGGAAACG GTCTGGAAGC CTGGCTTACC 
GCGCATGTGC TGTACAAGGC ACTCGGGGGA GAGGCGGTCT CAATCCATAT CCAACCGCTC
GACTCAAGGA TCCATGACGA CTGCGCCTAT ACGATCTTGC GCCCATCGGC GCTGGTCGCT
CTGGACCAGT TCGGTCTGAA CCTGCAGGGC CTTCTGCGTC TGCCGGCGAC CCTGCCCAGC
CTGGGTCAGG TCCTTCGTTC TGCAAGCGGC GTCGACACGC TCCTGCCCTA TGGCGGGCAA
GGTGTCGATT GGGCCGGGAC CAGTTTTCAC CATCATTGGC TTCGCGCACG CCAGGCCGGT
TTGCGCCATC CCTATTTCGC CTTTTCGCCC GGCTATCACG CCATGGCGTC AGACCGCTTT
GCCCCGCCCG ACAGACGCAA CGCGATAGGG CCGATGCAGC ATGAAAGCGG CTTGCACGTC
AGTACTCGGG AGTTGACGGA AAACCTGCGG CTCAATTTGC AGGCCAACAT TATCGTCCTC
GATCCGACAG CCAATTGCGA ACAGGCTGAT CTTGTCATCC ATGCGCCCGG GGCTCCGGAC
CGGAATTCTA CGCTGGACCC GGTTCACGCT GGCCCGCCCA AACCCTACGC CGTGCGAGTC
GAGGCTGGCG GCGAATCGCG CCTGCAAATC CCGCTGCGTT CCGGCTGGTT GGACCTGCCG
ACGCCTGCGG GACCGGCCAA ACGTCATTGT GGCAACTCAC CCTGGGGCGA GGATGGCCTG
GTCGTCGGGC TCGCGGCCGC CCACCTGCCC GGCCTTGAAG ACCGGTCCAT GGACCGGCTT
CTCTTCGAGC TCGAAACCTT GCTGGAACTC TGGCCACGGT CCGGTGTCCA TTCCGCGGAG
GCCCTCGAAT ACAACCGCCT TTGGGCGCAG GAGGCCGATG AATGGACAGC ACTGAGCGCG
CTGTCGGCCG ACCAGGAAGA CCAACGTTGT CAGGCGCGGA AGGCTGTGTT CCGCCGACGC
GGTTATATCG AGCCGCTGGA AAGCCGCACG ATCACTCCGG AGGACTGGGT GGAGGCCTTT
ATTGGGCGCG GTGTGATCCC TGCCCATTAC GACCGGTTGA GTGAGCGTTT GACCGACCCG
CAGCTCAAGT CCGAGCTGCA AAAATTCACC GATGCGGTCG GTCGGACCGT CCGCGAGTTT
CCCAGTTTCC CGAGCTATCT ACGGGCGATC GACCGAGCGG TCGGGCCGGC CACCGATGCC
AAGACGGAGC CATCCGCATG A
 
Protein sequence
MAEPIRSICI LGNGLEAWLT AHVLYKALGG EAVSIHIQPL DSRIHDDCAY TILRPSALVA 
LDQFGLNLQG LLRLPATLPS LGQVLRSASG VDTLLPYGGQ GVDWAGTSFH HHWLRARQAG
LRHPYFAFSP GYHAMASDRF APPDRRNAIG PMQHESGLHV STRELTENLR LNLQANIIVL
DPTANCEQAD LVIHAPGAPD RNSTLDPVHA GPPKPYAVRV EAGGESRLQI PLRSGWLDLP
TPAGPAKRHC GNSPWGEDGL VVGLAAAHLP GLEDRSMDRL LFELETLLEL WPRSGVHSAE
ALEYNRLWAQ EADEWTALSA LSADQEDQRC QARKAVFRRR GYIEPLESRT ITPEDWVEAF
IGRGVIPAHY DRLSERLTDP QLKSELQKFT DAVGRTVREF PSFPSYLRAI DRAVGPATDA
KTEPSA