Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1837 |
Symbol | |
ID | 8534995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1971740 |
End bp | 1972951 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384218 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_003263706 |
Protein GI | 261856423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTGATG CATCTGAGAA ATTAGCGCAA TCCGTCTTAA GCCGATCTGT GGGGATCGTC GAGCCGAAGA CGGCCCGATT TTCCGAGCCG TTGGCCTTGG ATTGTGGTCG TTCACTGCCC TCCTATGAGC TGGTTTACGA AACCTACGGT CAGCTTAATG ATGAGGGCAG TAATGCGGTG CTGATCTGCC ATGCCTTGTC GGGCGATCAC CATGCGGCAG GTTTCCACGC AGAAACCGAC CGCAAGCCGG GGTGGTGGGA TTCGGCCATC GGGCCCGGGA AACCGATCGA TACGGATCGG TTTTTTGTGG TGTGTCTGAA CAACTTGGGT GGCTGCAAGG GATCGACCGG CCCGTTAAGC GTCGATCCGG CCAGCGGCAA ACCCTACGGG CCGGACTTTC CGATCGTGAC GGTGAAAGAC TGGGTGCACG CCCAATATCG ACTTATGCAG TACCTGGGGT TGTCCGGTTG GGCGGCGGTG ATCGGTGGCA GTTTGGGCGG CATGCAGGTT TTGCAGTGGT CGATTACTTA TCCTGATGCA GTCGCCCATG CTGTCGTAAT CGCTGCGGCT CCGCGCTTGT CGGCGCAAAA CATTGCGTTC AACGAAGTGG CGCGTCAGGC CATCATCACC GATCCGGAGT TTTATGGCGG GCGTTACGCG GATCACAATG CATTGCCGCG CCGGGGGCTG ATGCTTGCCC GGATGCTGGG GCACATTACG TACTTATCCG ACGATGCAAT GCGAGCCAAG TTCGGTCGGG AACTGCGCGC GGGGCAGGTG CAGTACGGGT TCGATGTCGA GTTTCAGGTT GAAAGTTATC TGCGATATCA AGGCACCAGT TTCGTTGATC GTTTCGATGC CAATACGTAC CTGCTGATGA CCAAGGCGTT GGATTACTTC GATCCGGCAC AGGCCAGTAA TGATGATCTG GTCGCCGCAC TGGCCGAGGT TAAGGCGCAT TTCCTTGTGG TTTCGTTTAC ATCGGACTGG CGTTTCTCGC CCGAGCGATC CCGGGAGATC GTGCGTGCCC TGCTGGCGTC CGGAAAGCAG GTTTCTTATG CCGAAATCGA GTCGAATCAC GGCCATGACG CCTTCCTGAT GACGATTCCC TACTACCACC GTGTACTGGC AGGTTATATG GCCAATATCG ATTTCGCCTC CACGCCGCGC GGCGTTTCGA GCCCGGTATA CAGCACGGGG GGTGCCGTAT GA
|
Protein sequence | MSDASEKLAQ SVLSRSVGIV EPKTARFSEP LALDCGRSLP SYELVYETYG QLNDEGSNAV LICHALSGDH HAAGFHAETD RKPGWWDSAI GPGKPIDTDR FFVVCLNNLG GCKGSTGPLS VDPASGKPYG PDFPIVTVKD WVHAQYRLMQ YLGLSGWAAV IGGSLGGMQV LQWSITYPDA VAHAVVIAAA PRLSAQNIAF NEVARQAIIT DPEFYGGRYA DHNALPRRGL MLARMLGHIT YLSDDAMRAK FGRELRAGQV QYGFDVEFQV ESYLRYQGTS FVDRFDANTY LLMTKALDYF DPAQASNDDL VAALAEVKAH FLVVSFTSDW RFSPERSREI VRALLASGKQ VSYAEIESNH GHDAFLMTIP YYHRVLAGYM ANIDFASTPR GVSSPVYSTG GAV
|
| |