Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2237 |
Symbol | |
ID | 8535401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2408980 |
End bp | 2410314 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384617 |
Product | tryptophan halogenase |
Protein accession | YP_003264099 |
Protein GI | 261856816 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACGAG TAGAAACATC AGAACCGGTT CACCATTGCG ACGTATTGGT CATTGGCGGT GGCCCAGCGG GTTCGACCGT CGCGCCGTTA TTGGTCGAGC GAGGCTATCG GGTGGTTGTG CTGGAAAAGG CACATCACCC GCGTTTTCAC ATTGGCGAAT CGTTGCTGCC CGCCAATATG CCCTTGTTCG ACCGCTTGGG TGTCGGTGAA GAAATTCGTG CCGTCGGCAT GGAAAAATGG GGCGCGGAAT TCGTATCGCC GCACCACGAC CACAAGCAGG TGTTTGAATT TGCCGAGGCT TGGGATAAGT CCATGCCGAT GGCCTATCAG GTGCCGCGCG CCCAGTTTGA CGAAATTCTG ATCCGCAATG CTCGTCAAAA AGGCGCGGAA GTGATCGAAG GCTGCCGTGC CAAGAATGTG GAGTTCCTGC CGGACGATCA TCCGAGTGGC CCTGGCGCGC GCGTTACCGC CGTGATGGAC GATGGCACCC AAACCGAATG GAAAACGAAG TTCGTGGTCG ATGCCTCGGG GCGCGACACG TTCCTTGCCA ACAAGATGCA GTCCAAACAG CGCAACCCCA AGCACAACAG TTCGGCGGTG TACGGCCATA TGCGTGGCGC GCTGCGCAAT GAAGGCAAGG CCGAAGGCAA TATCACCATT TTCTGGTTCG ATCACGGCTG GTTCTGGTTT ATTCCGCTGA TGGACGGCAT CACGAGCATC GGCATGGTGA CCTGGCCCTA CCACATGAAA TCGCGCGGTG ACCGCAGTCT CGAGCAGTTC CTGCGTGACA ACATCGAGTC CTGCGCACCG CTGGCCGAAC AATTGCGCGG TGCCGAATTT GTGAACAACG TCGAGGCCAC GGGCAATTAT TCGTACCTGT CCGACCGTAC GCATGGGAAC AACTACGTGC TGCTGGGCGA TGCCTTCGCC TTTATCGATC CGGTCTTTTC CTCTGGCGTT TTGCTCGCTA TGCAAAGTGC CGTCTTGGCA ACCGATGCGA TTGATACGGC CTTGCAGCAC CCCGCCAAAG CGGCTGCGGC ACTGAAGAAG TTCGATAAAC AGATACGGAT GGGGCCGCGC GAGTTTTCGT GGTTCATTTA CCGTGTGACC AATCCGGCGA TGCGCGATCT GTTCATGGGG CCGCGCAATA TTTTCCGCGT CAAGGAAGCG CTGCTTTCCC TGCTGGCGGG CGATATTTAC GGCAAGACGC CCATCTGGAC TTCGTTGCGT ATGATGAAGG TAATCTATAC CGTCGCCCTG CTCAGAAACC CCCTGCGTGC CTACCGGGCA TGGCAGGCGC GTAGGTTTAA TATTCGTCCC GAACTGGATG CGTGA
|
Protein sequence | MPRVETSEPV HHCDVLVIGG GPAGSTVAPL LVERGYRVVV LEKAHHPRFH IGESLLPANM PLFDRLGVGE EIRAVGMEKW GAEFVSPHHD HKQVFEFAEA WDKSMPMAYQ VPRAQFDEIL IRNARQKGAE VIEGCRAKNV EFLPDDHPSG PGARVTAVMD DGTQTEWKTK FVVDASGRDT FLANKMQSKQ RNPKHNSSAV YGHMRGALRN EGKAEGNITI FWFDHGWFWF IPLMDGITSI GMVTWPYHMK SRGDRSLEQF LRDNIESCAP LAEQLRGAEF VNNVEATGNY SYLSDRTHGN NYVLLGDAFA FIDPVFSSGV LLAMQSAVLA TDAIDTALQH PAKAAAALKK FDKQIRMGPR EFSWFIYRVT NPAMRDLFMG PRNIFRVKEA LLSLLAGDIY GKTPIWTSLR MMKVIYTVAL LRNPLRAYRA WQARRFNIRP ELDA
|
| |