Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2219 |
Symbol | |
ID | 8535383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2388124 |
End bp | 2389149 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646384599 |
Product | 2OG-Fe(II) oxygenase |
Protein accession | YP_003264081 |
Protein GI | 261856798 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.127604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCATA CCCATTTCAG ATCCATTCCT CTAATCGATA TATCCGGGTT ATATGACGAT AGCCTGGTCG AACGCCAACG CGTTGCCGAT GAGCTGGGCC GAGCGGCACA CGATGTCGGC TTCCTGCAGA TTACCGGGCA CGGCATTTCC CGCTCGTTAC GCGATGGATT GATCCGACAG GCGCGCAGAT TTTTCGAGCG CCCGCTGCAC GAGAAGATGC GCTTCTACAT CGGCCAGTCG AGTAACCACA GTGGTTATGT GCCGGAAGGC GAGGAACAGT TCGCGGGCGG CGGCAAGGAT CTCAAGGAAG CTTATGATGT CAATTACAAC TACACAGAGG CAGCACAGAT CTATCCGCTG CTGGGCCCGA CTCAATGGCC TGATTCGGCG GATTTCAGGC TAGAGGTGGG TGCCTATTAC CGCGCGGCGC TGGCGCTGGG CGACACGCTG TTTCGTGGTT TTGCCCTGGC GCTGGGATTG GCGGAGGAGA CCTTCGCCAA GATCACCCGT CACCCCACCA GCCAGTTGCG ACTGATCCAC TACCCGCTCG ACCCTGATCC GGTGGCGGAC CGTCCCGGTA TTGGCGCGCA CACCGACTAT GAGTGCTTCA CCATTCTTTT ACCCACAGCT GAGGGGCTTC AGGTACTCAA TGGGAATGGT CAATGGATCG ATGTGCCACT GGTGGAAGAT GCCTTCGTGA TCAACATCGG CGACATGCTC GAAGTGCTCA GCAACGGCCA TTTCGTGGCT ACCTCGCATC GCGTACGCAA GGTCAGCGAG GAGCGCTTCG CCTTCCCACT ATTCTGCGCC TGTGACTACA CGACTCGTAT TGCTCCGATA GCCGGACTTC CCCCGCGCGG TGAGCGCCGA TACGAGCCGA TCAACTGTGG CGACCACCTG TTCGCCCAGA CCGCGCAGAC CTTCCGTTAT CTGCGCGAGC GGTTGGAGGA TGGTTCGCTG CAACTGCCCG ATGGCGCGGC AGGCTTGTCC AGTTTCGGTC ATGGATACCG TGCGGAGAAG GCATGA
|
Protein sequence | MAHTHFRSIP LIDISGLYDD SLVERQRVAD ELGRAAHDVG FLQITGHGIS RSLRDGLIRQ ARRFFERPLH EKMRFYIGQS SNHSGYVPEG EEQFAGGGKD LKEAYDVNYN YTEAAQIYPL LGPTQWPDSA DFRLEVGAYY RAALALGDTL FRGFALALGL AEETFAKITR HPTSQLRLIH YPLDPDPVAD RPGIGAHTDY ECFTILLPTA EGLQVLNGNG QWIDVPLVED AFVINIGDML EVLSNGHFVA TSHRVRKVSE ERFAFPLFCA CDYTTRIAPI AGLPPRGERR YEPINCGDHL FAQTAQTFRY LRERLEDGSL QLPDGAAGLS SFGHGYRAEK A
|
| |