Gene Hneap_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1336 
Symbol 
ID8534492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1446744 
End bp1447985 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID646383727 
Productchorismate mutase 
Protein accessionYP_003263217 
Protein GI261855934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGACG CTTTGGGTGA AGCGGGGGCT GGTTGGTCGG ATCGACGTTC GCCAGATCAG 
CCTCTGCCCA ACACGCTGGC TGAAGTGCGG CAACAGATCG ACTCACTCGA CTCCGAGTTG
ATTGCATTGA TTTCCAAACG TGCCCGCCTC GCCGAGCGCG TCGCGGAGAT CAAGCAGTTG
TCCAACGAAC CTCAGACATC TTTTTATCGT CCCGAACGAG AGGCTCAGGT ACTGTCACGT
GTGCGTGAAC GTAATCCTGG CCCTCTTTCG GGTGATACGA TGGTTTGGCT TTTTCGGGAA
ATTATGTCGG CTTGCCTCGC GCTTGAGCAG CCGCTTTCCG TAGCCTGCCT CGGTCCTTCG
GGCACCTTTT CCGAACAGGC GGCCTTGCGC GCGTTTGGTC ACGGCGCCCA TCTGGTTCTT
GAGCCGGGTA TTCCCGAAGT TTTCCGTGCC GTTGCGGCGG GTTCCGTTGA TTTCGGCGTT
GTACCCGTTG AAAACTCCAC GGAAGGCAGC GTGTCCCAAA CCTTGGATGC CTTGGCATTT
GGAGCGACTG GTGGCGCCTT GTATGGTGTT TGGGTGCCCG GGGAAGTCCG CATTTGCGGT
GAACTGTCGC TCAAGATAGA TCAGCAGTTG ATGGCTCGGC AGGATGCGCG CGATGTCTTG
CCCCAGCGTA TTGTGTCCCA TGCCCAGTCA TTGGCCCAAT GTCGGGAATG GCTGGACGTT
CACTATCCCG GCGTCGAACG CATTGCGGTG CAGAGCAATA GCGAAGCTGC TCGCCTTGCC
GCTGAATCAC CAAGCATCAT GGCCATTGGT CCCACGCTGG CTGCAGAGCA GCATGGTCTG
GATATAGTCG CTGCAAACAT TCAGGACAGT GCGTTCAATA CCACCCGATT CGTCGTGATC
GGTCGGGACA CAGTACCGCC CTCGGGTGCG GACAAAACCT CACTGGTGCT TTCCGTCAAC
AACATGCCCG GAGCGTTGTC GCGTTTGCTT GCACCGCTGG CGGAGGCGGG GATTGATGTT
ATGCGCATCG AATCCCGACC AGCGCGGGAA CGGGCGTGGG AGTATGTGTT TTTCATCGAT
TTCGAAGGAC ATGCAGACGA TGAGCGCATT CGGGCGGCCC TGAGCAAAAT GCAACCGTTT
TGCAGTTCTC TTCGTGTTTT GGGTTCCTAT CCACGCGCGG TCATGTCTGC ATCAAGTCCC
AATGCAGCGG GTGGATCAAC ATCGGGCAGG GTTGTGTCAT GA
 
Protein sequence
MTDALGEAGA GWSDRRSPDQ PLPNTLAEVR QQIDSLDSEL IALISKRARL AERVAEIKQL 
SNEPQTSFYR PEREAQVLSR VRERNPGPLS GDTMVWLFRE IMSACLALEQ PLSVACLGPS
GTFSEQAALR AFGHGAHLVL EPGIPEVFRA VAAGSVDFGV VPVENSTEGS VSQTLDALAF
GATGGALYGV WVPGEVRICG ELSLKIDQQL MARQDARDVL PQRIVSHAQS LAQCREWLDV
HYPGVERIAV QSNSEAARLA AESPSIMAIG PTLAAEQHGL DIVAANIQDS AFNTTRFVVI
GRDTVPPSGA DKTSLVLSVN NMPGALSRLL APLAEAGIDV MRIESRPARE RAWEYVFFID
FEGHADDERI RAALSKMQPF CSSLRVLGSY PRAVMSASSP NAAGGSTSGR VVS