Gene Mmar10_2721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2721 
Symbol 
ID4286086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2984854 
End bp2986377 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content60% 
IMG OID638142220 
Producttryptophan halogenase 
Protein accessionYP_757945 
Protein GI114571265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000284489 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0465531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGTCAAG ATGCCCCAAG GCGGATCGTC GTTGTCGGCG GAGGCACCGC AGGATGGATG 
GCTGCGGCCG CGCTGGTGTC GGTTTTGCCC AGCCAGCGCG TCCAGGTCAC CCTGGTCGAA
TCCGAAGCAA TCGGCATCAT CGGTGTGGGC GAGGCGACGC TTCCCCATCT GCGCCATTTC
AATGAAACCC TTGGCATCAA CGAGGCCGAT TTCATCAAGG CGACGTCCGC CACGCTGAAG
CTCGGCATCG AGTTCGTGAA CTGGGCCCGA AAGGGCGACA GCTATGTGCA CCCATTCGGC
GATTTCGGGA CCGAGATTGC CGGCCTGCCC TTTCATCAAG CCTGGACCCG GATGCGGGCC
GCCGGCAAGG CCCGGGATAT CGGTGCCTAC TCGCTTCCCG TCCGCATGTG CGCGGCAAAC
CGGTTCGACA GACCCGCAGA AGACCCGGCC GATTTTGCAT CCCGCTTCGG CTATGCCTAC
CAGTTCGACG CCACCCGTTA TGCGCCCTTC CTGCGCCAGC ATGCCGAAGC CCGCGGCGCG
ACCCGAATTG AGGGCATTGT CGACACGGTT CATTGCGATC CTGAAACCGG CGATATCGAG
CGGCTCGACC TGAAGGACGG GCAAGAGATC GAAGGCGATT TCTTCTTTGA CTGCACCGGA
TTCCGAGGCG TTCTGATCGA GCAGGCGTTG AATGTGGGTT ATGAGGACTG GTCACATTGG
CTGCCGTGCA ACCGGGCTAT CGCCCTGCCT AGCGAAAAAT CCGGACCAAC CCCGCCCTAC
ACGCGTGCAA CCGCACATCA GGCGGGCTGG CTATGGCGGA TTCCGCTGCA GCACCGCACC
GGGAACGGGC ATGTCTATGC CAGCGATTTC ATCGATGATG AGACGGCCCG TCAGACGCTG
CTCGACAATC TGGAAGGCGC GCCCCTGGCT GATCCTCGAC CGCTGCGTTT CACAACCGGC
AGGCGTAAAC AATTCTGGGC TCATAATTGC GTCAGCATCG GGCTGGCTGG CGGTTTCCTT
GAACCGCTTG AATCGACGAG CATCCATCTA ACGCAGATCG CAATCACGCA ATTCATTGAA
CTGTTTCCGG TAGATAACGA TTACACGCTT GAGCGAGAAA GCTACAACGC GCACATGACG
CGGGAATTCG AGCGCGTGCG CGACTTCCTC ATCCTCCACT ATCATGCGAC CGAACGGACT
GATTCCGAGT TCTGGAACTA CGTCCGCACC ATGCCGGTAC CGGATTCGCT GACGGAAAAA
ATGGCCCTGT TTCGGCAAAC CGGGCGTGTC GGTCGATATC AGCAAGGGCT ATTTTTGGAG
CCCAGCTGGC TCGCCGTCTA TCTCGGTCAG CGAATCGTCC CGCAGAGCTG GGATGGGCGA
TTGGACACGA TCCCGGAAGA ATCCCTCGGT CAGTCGCTGA CAACAATCGA GTCGCAGATC
GATCAGGCGA TGTCCCGTAT GCCCGACCAT GACACCTGGC TGGCCAATCT GGCCGGCGTC
GAGGCGGAGA CGCGTCATGG CTGA
 
Protein sequence
MGQDAPRRIV VVGGGTAGWM AAAALVSVLP SQRVQVTLVE SEAIGIIGVG EATLPHLRHF 
NETLGINEAD FIKATSATLK LGIEFVNWAR KGDSYVHPFG DFGTEIAGLP FHQAWTRMRA
AGKARDIGAY SLPVRMCAAN RFDRPAEDPA DFASRFGYAY QFDATRYAPF LRQHAEARGA
TRIEGIVDTV HCDPETGDIE RLDLKDGQEI EGDFFFDCTG FRGVLIEQAL NVGYEDWSHW
LPCNRAIALP SEKSGPTPPY TRATAHQAGW LWRIPLQHRT GNGHVYASDF IDDETARQTL
LDNLEGAPLA DPRPLRFTTG RRKQFWAHNC VSIGLAGGFL EPLESTSIHL TQIAITQFIE
LFPVDNDYTL ERESYNAHMT REFERVRDFL ILHYHATERT DSEFWNYVRT MPVPDSLTEK
MALFRQTGRV GRYQQGLFLE PSWLAVYLGQ RIVPQSWDGR LDTIPEESLG QSLTTIESQI
DQAMSRMPDH DTWLANLAGV EAETRHG