Gene Hneap_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1837 
Symbol 
ID8534995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1971740 
End bp1972951 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content57% 
IMG OID646384218 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003263706 
Protein GI261856423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTGATG CATCTGAGAA ATTAGCGCAA TCCGTCTTAA GCCGATCTGT GGGGATCGTC 
GAGCCGAAGA CGGCCCGATT TTCCGAGCCG TTGGCCTTGG ATTGTGGTCG TTCACTGCCC
TCCTATGAGC TGGTTTACGA AACCTACGGT CAGCTTAATG ATGAGGGCAG TAATGCGGTG
CTGATCTGCC ATGCCTTGTC GGGCGATCAC CATGCGGCAG GTTTCCACGC AGAAACCGAC
CGCAAGCCGG GGTGGTGGGA TTCGGCCATC GGGCCCGGGA AACCGATCGA TACGGATCGG
TTTTTTGTGG TGTGTCTGAA CAACTTGGGT GGCTGCAAGG GATCGACCGG CCCGTTAAGC
GTCGATCCGG CCAGCGGCAA ACCCTACGGG CCGGACTTTC CGATCGTGAC GGTGAAAGAC
TGGGTGCACG CCCAATATCG ACTTATGCAG TACCTGGGGT TGTCCGGTTG GGCGGCGGTG
ATCGGTGGCA GTTTGGGCGG CATGCAGGTT TTGCAGTGGT CGATTACTTA TCCTGATGCA
GTCGCCCATG CTGTCGTAAT CGCTGCGGCT CCGCGCTTGT CGGCGCAAAA CATTGCGTTC
AACGAAGTGG CGCGTCAGGC CATCATCACC GATCCGGAGT TTTATGGCGG GCGTTACGCG
GATCACAATG CATTGCCGCG CCGGGGGCTG ATGCTTGCCC GGATGCTGGG GCACATTACG
TACTTATCCG ACGATGCAAT GCGAGCCAAG TTCGGTCGGG AACTGCGCGC GGGGCAGGTG
CAGTACGGGT TCGATGTCGA GTTTCAGGTT GAAAGTTATC TGCGATATCA AGGCACCAGT
TTCGTTGATC GTTTCGATGC CAATACGTAC CTGCTGATGA CCAAGGCGTT GGATTACTTC
GATCCGGCAC AGGCCAGTAA TGATGATCTG GTCGCCGCAC TGGCCGAGGT TAAGGCGCAT
TTCCTTGTGG TTTCGTTTAC ATCGGACTGG CGTTTCTCGC CCGAGCGATC CCGGGAGATC
GTGCGTGCCC TGCTGGCGTC CGGAAAGCAG GTTTCTTATG CCGAAATCGA GTCGAATCAC
GGCCATGACG CCTTCCTGAT GACGATTCCC TACTACCACC GTGTACTGGC AGGTTATATG
GCCAATATCG ATTTCGCCTC CACGCCGCGC GGCGTTTCGA GCCCGGTATA CAGCACGGGG
GGTGCCGTAT GA
 
Protein sequence
MSDASEKLAQ SVLSRSVGIV EPKTARFSEP LALDCGRSLP SYELVYETYG QLNDEGSNAV 
LICHALSGDH HAAGFHAETD RKPGWWDSAI GPGKPIDTDR FFVVCLNNLG GCKGSTGPLS
VDPASGKPYG PDFPIVTVKD WVHAQYRLMQ YLGLSGWAAV IGGSLGGMQV LQWSITYPDA
VAHAVVIAAA PRLSAQNIAF NEVARQAIIT DPEFYGGRYA DHNALPRRGL MLARMLGHIT
YLSDDAMRAK FGRELRAGQV QYGFDVEFQV ESYLRYQGTS FVDRFDANTY LLMTKALDYF
DPAQASNDDL VAALAEVKAH FLVVSFTSDW RFSPERSREI VRALLASGKQ VSYAEIESNH
GHDAFLMTIP YYHRVLAGYM ANIDFASTPR GVSSPVYSTG GAV