Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2723 |
Symbol | |
ID | 4286088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2987647 |
End bp | 2989176 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638142222 |
Product | tryptophan halogenase |
Protein accession | YP_757947 |
Protein GI | 114571267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.189353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.019057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCA CCATCTCGAA AGTTGTAATC GCCGGAGGCG GTACCGCTGG CTGGATGACG GCGGCGGCCC TGTCCCGCTT TCTTGTGCCT TCCGGCGTCA CAGTCGAACT GGTGGAAAGC GAACAGATCG GAACGGTTGG GGTTGGCGAG GCGACCATCC CGGGCATCAT CGACTTCAAC CGCATGCTCG GAATTGACGA GGCGGACTTC ATTGCCGCCA CCAAGGGGAC GTTCAAGCTG GGCATTGAAT TTGTCGACTG GGACCGGGTC GGAAATCGCT ATCTTCACCC GTTCGGTGAA TACGGCTTCG ACCTGGAGGG CGTCCCTTTC CATCATTACT GGCTGCGCGA CCGGTTGCGC GGGAGTGATC ACCCGCTGTC GGCCTACTCC ATGTGTTGCC AGGCGGCCAT GTCCGGGAAA TTCATGCGGC CGGTGAGCGA TCCGCAATCG CCGGTCGCCC AAATGCGTCA CGCCTACCAT TTCGATGCCG GGCTCTACGC CCGCTACTTG CGCAACTATG CGGAACAGCG TGGCGTGACG CGTGTGGAAG GCCGGATCAA GGCGGTTGAT CAATCAACAG AGACCGGCTC GCTGACCGCG CTCGAGCTTG AGAATGGCAG CCGGATCGAG GGCGACATCT TCGTGGATTG CACCGGCTTT CGAGCCTTGT TGATCGGCGA AACGCTGGGC GTCGACTATG ATGACTGGCG CCGCTACCTG CCATGCGACC GGGCAATTGC CGTGCCCTGC GAAAAGATCG GGGCCGCCGC CCCGTACACA CGCGCAACGG CCCGTGAGGC CGGCTGGCAA TGGCGTATTC CCCTGCAGCA CCGGACCGGT AACGGGTATG TCTATTCGTC CTCCTTCCTG AACGATGACG AGGCAGAGAG TGCACTCCTG GCCAATCTCG ACGCTCCGAC GACCGGCCCG ACCAACAGGC TGCGTTTCAC CCCGGGACGC CGCCGCTCGG TGTGGAAGAA GAACTGTGTC GCGATCGGCT TGTCCGCCGG CTTCCTTGAA CCTCTGGAAT CGACCAGCAT CCATCTCATC CAGGAGGGCG TCAGCAAGCT GCTGGCCCTG TTCCCGCGGG GCGGGATCAA CCAACGCGAG GTCACCCGCT ACAATTCGAT TATCGGCAAT GCCTATGACT ATGTGAGGGA TTTCCTGATC CTCCACTACA ACGCGACCAC GCGCGACGAT ACGCCATTCT GGGACTATGT GCGGACAATG GCCGTGCCCG ACAGCCTCAC GGAAACGGTT GAACTCTTCG CAGAAAACGG GCGCTTCTTC GCCCACAAGA GCGACCTGTT TAGCATCACG TCCTGGGTCG CGGTGATGAT CGGCCAGGGA ATATTGCCAC GCGGCTATGA TCCGGTGGCC GACTCCATCC CGGACCAGGA TCTCGTCGCC ACGCTCACCA ACATGCGCGA GATCTATGCC CAGGCGGCTT CGAAAATGCC GCCACATCAG GCCTTTATCG ACCATCTTGC AAAATCCGCC CGCCAAGGAG GCCAGGCCCA TGCCCGTTGA
|
Protein sequence | MTGTISKVVI AGGGTAGWMT AAALSRFLVP SGVTVELVES EQIGTVGVGE ATIPGIIDFN RMLGIDEADF IAATKGTFKL GIEFVDWDRV GNRYLHPFGE YGFDLEGVPF HHYWLRDRLR GSDHPLSAYS MCCQAAMSGK FMRPVSDPQS PVAQMRHAYH FDAGLYARYL RNYAEQRGVT RVEGRIKAVD QSTETGSLTA LELENGSRIE GDIFVDCTGF RALLIGETLG VDYDDWRRYL PCDRAIAVPC EKIGAAAPYT RATAREAGWQ WRIPLQHRTG NGYVYSSSFL NDDEAESALL ANLDAPTTGP TNRLRFTPGR RRSVWKKNCV AIGLSAGFLE PLESTSIHLI QEGVSKLLAL FPRGGINQRE VTRYNSIIGN AYDYVRDFLI LHYNATTRDD TPFWDYVRTM AVPDSLTETV ELFAENGRFF AHKSDLFSIT SWVAVMIGQG ILPRGYDPVA DSIPDQDLVA TLTNMREIYA QAASKMPPHQ AFIDHLAKSA RQGGQAHAR
|
| |