Gene Hneap_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2237 
Symbol 
ID8535401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2408980 
End bp2410314 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content57% 
IMG OID646384617 
Producttryptophan halogenase 
Protein accessionYP_003264099 
Protein GI261856816 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGAG TAGAAACATC AGAACCGGTT CACCATTGCG ACGTATTGGT CATTGGCGGT 
GGCCCAGCGG GTTCGACCGT CGCGCCGTTA TTGGTCGAGC GAGGCTATCG GGTGGTTGTG
CTGGAAAAGG CACATCACCC GCGTTTTCAC ATTGGCGAAT CGTTGCTGCC CGCCAATATG
CCCTTGTTCG ACCGCTTGGG TGTCGGTGAA GAAATTCGTG CCGTCGGCAT GGAAAAATGG
GGCGCGGAAT TCGTATCGCC GCACCACGAC CACAAGCAGG TGTTTGAATT TGCCGAGGCT
TGGGATAAGT CCATGCCGAT GGCCTATCAG GTGCCGCGCG CCCAGTTTGA CGAAATTCTG
ATCCGCAATG CTCGTCAAAA AGGCGCGGAA GTGATCGAAG GCTGCCGTGC CAAGAATGTG
GAGTTCCTGC CGGACGATCA TCCGAGTGGC CCTGGCGCGC GCGTTACCGC CGTGATGGAC
GATGGCACCC AAACCGAATG GAAAACGAAG TTCGTGGTCG ATGCCTCGGG GCGCGACACG
TTCCTTGCCA ACAAGATGCA GTCCAAACAG CGCAACCCCA AGCACAACAG TTCGGCGGTG
TACGGCCATA TGCGTGGCGC GCTGCGCAAT GAAGGCAAGG CCGAAGGCAA TATCACCATT
TTCTGGTTCG ATCACGGCTG GTTCTGGTTT ATTCCGCTGA TGGACGGCAT CACGAGCATC
GGCATGGTGA CCTGGCCCTA CCACATGAAA TCGCGCGGTG ACCGCAGTCT CGAGCAGTTC
CTGCGTGACA ACATCGAGTC CTGCGCACCG CTGGCCGAAC AATTGCGCGG TGCCGAATTT
GTGAACAACG TCGAGGCCAC GGGCAATTAT TCGTACCTGT CCGACCGTAC GCATGGGAAC
AACTACGTGC TGCTGGGCGA TGCCTTCGCC TTTATCGATC CGGTCTTTTC CTCTGGCGTT
TTGCTCGCTA TGCAAAGTGC CGTCTTGGCA ACCGATGCGA TTGATACGGC CTTGCAGCAC
CCCGCCAAAG CGGCTGCGGC ACTGAAGAAG TTCGATAAAC AGATACGGAT GGGGCCGCGC
GAGTTTTCGT GGTTCATTTA CCGTGTGACC AATCCGGCGA TGCGCGATCT GTTCATGGGG
CCGCGCAATA TTTTCCGCGT CAAGGAAGCG CTGCTTTCCC TGCTGGCGGG CGATATTTAC
GGCAAGACGC CCATCTGGAC TTCGTTGCGT ATGATGAAGG TAATCTATAC CGTCGCCCTG
CTCAGAAACC CCCTGCGTGC CTACCGGGCA TGGCAGGCGC GTAGGTTTAA TATTCGTCCC
GAACTGGATG CGTGA
 
Protein sequence
MPRVETSEPV HHCDVLVIGG GPAGSTVAPL LVERGYRVVV LEKAHHPRFH IGESLLPANM 
PLFDRLGVGE EIRAVGMEKW GAEFVSPHHD HKQVFEFAEA WDKSMPMAYQ VPRAQFDEIL
IRNARQKGAE VIEGCRAKNV EFLPDDHPSG PGARVTAVMD DGTQTEWKTK FVVDASGRDT
FLANKMQSKQ RNPKHNSSAV YGHMRGALRN EGKAEGNITI FWFDHGWFWF IPLMDGITSI
GMVTWPYHMK SRGDRSLEQF LRDNIESCAP LAEQLRGAEF VNNVEATGNY SYLSDRTHGN
NYVLLGDAFA FIDPVFSSGV LLAMQSAVLA TDAIDTALQH PAKAAAALKK FDKQIRMGPR
EFSWFIYRVT NPAMRDLFMG PRNIFRVKEA LLSLLAGDIY GKTPIWTSLR MMKVIYTVAL
LRNPLRAYRA WQARRFNIRP ELDA