Gene Mmar10_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2723 
Symbol 
ID4286088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2987647 
End bp2989176 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID638142222 
Producttryptophan halogenase 
Protein accessionYP_757947 
Protein GI114571267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.189353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.019057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCA CCATCTCGAA AGTTGTAATC GCCGGAGGCG GTACCGCTGG CTGGATGACG 
GCGGCGGCCC TGTCCCGCTT TCTTGTGCCT TCCGGCGTCA CAGTCGAACT GGTGGAAAGC
GAACAGATCG GAACGGTTGG GGTTGGCGAG GCGACCATCC CGGGCATCAT CGACTTCAAC
CGCATGCTCG GAATTGACGA GGCGGACTTC ATTGCCGCCA CCAAGGGGAC GTTCAAGCTG
GGCATTGAAT TTGTCGACTG GGACCGGGTC GGAAATCGCT ATCTTCACCC GTTCGGTGAA
TACGGCTTCG ACCTGGAGGG CGTCCCTTTC CATCATTACT GGCTGCGCGA CCGGTTGCGC
GGGAGTGATC ACCCGCTGTC GGCCTACTCC ATGTGTTGCC AGGCGGCCAT GTCCGGGAAA
TTCATGCGGC CGGTGAGCGA TCCGCAATCG CCGGTCGCCC AAATGCGTCA CGCCTACCAT
TTCGATGCCG GGCTCTACGC CCGCTACTTG CGCAACTATG CGGAACAGCG TGGCGTGACG
CGTGTGGAAG GCCGGATCAA GGCGGTTGAT CAATCAACAG AGACCGGCTC GCTGACCGCG
CTCGAGCTTG AGAATGGCAG CCGGATCGAG GGCGACATCT TCGTGGATTG CACCGGCTTT
CGAGCCTTGT TGATCGGCGA AACGCTGGGC GTCGACTATG ATGACTGGCG CCGCTACCTG
CCATGCGACC GGGCAATTGC CGTGCCCTGC GAAAAGATCG GGGCCGCCGC CCCGTACACA
CGCGCAACGG CCCGTGAGGC CGGCTGGCAA TGGCGTATTC CCCTGCAGCA CCGGACCGGT
AACGGGTATG TCTATTCGTC CTCCTTCCTG AACGATGACG AGGCAGAGAG TGCACTCCTG
GCCAATCTCG ACGCTCCGAC GACCGGCCCG ACCAACAGGC TGCGTTTCAC CCCGGGACGC
CGCCGCTCGG TGTGGAAGAA GAACTGTGTC GCGATCGGCT TGTCCGCCGG CTTCCTTGAA
CCTCTGGAAT CGACCAGCAT CCATCTCATC CAGGAGGGCG TCAGCAAGCT GCTGGCCCTG
TTCCCGCGGG GCGGGATCAA CCAACGCGAG GTCACCCGCT ACAATTCGAT TATCGGCAAT
GCCTATGACT ATGTGAGGGA TTTCCTGATC CTCCACTACA ACGCGACCAC GCGCGACGAT
ACGCCATTCT GGGACTATGT GCGGACAATG GCCGTGCCCG ACAGCCTCAC GGAAACGGTT
GAACTCTTCG CAGAAAACGG GCGCTTCTTC GCCCACAAGA GCGACCTGTT TAGCATCACG
TCCTGGGTCG CGGTGATGAT CGGCCAGGGA ATATTGCCAC GCGGCTATGA TCCGGTGGCC
GACTCCATCC CGGACCAGGA TCTCGTCGCC ACGCTCACCA ACATGCGCGA GATCTATGCC
CAGGCGGCTT CGAAAATGCC GCCACATCAG GCCTTTATCG ACCATCTTGC AAAATCCGCC
CGCCAAGGAG GCCAGGCCCA TGCCCGTTGA
 
Protein sequence
MTGTISKVVI AGGGTAGWMT AAALSRFLVP SGVTVELVES EQIGTVGVGE ATIPGIIDFN 
RMLGIDEADF IAATKGTFKL GIEFVDWDRV GNRYLHPFGE YGFDLEGVPF HHYWLRDRLR
GSDHPLSAYS MCCQAAMSGK FMRPVSDPQS PVAQMRHAYH FDAGLYARYL RNYAEQRGVT
RVEGRIKAVD QSTETGSLTA LELENGSRIE GDIFVDCTGF RALLIGETLG VDYDDWRRYL
PCDRAIAVPC EKIGAAAPYT RATAREAGWQ WRIPLQHRTG NGYVYSSSFL NDDEAESALL
ANLDAPTTGP TNRLRFTPGR RRSVWKKNCV AIGLSAGFLE PLESTSIHLI QEGVSKLLAL
FPRGGINQRE VTRYNSIIGN AYDYVRDFLI LHYNATTRDD TPFWDYVRTM AVPDSLTETV
ELFAENGRFF AHKSDLFSIT SWVAVMIGQG ILPRGYDPVA DSIPDQDLVA TLTNMREIYA
QAASKMPPHQ AFIDHLAKSA RQGGQAHAR