Gene Hneap_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1833 
Symbol 
ID8534991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1966984 
End bp1968081 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content60% 
IMG OID646384214 
Productchorismate synthase 
Protein accessionYP_003263702 
Protein GI261856419 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCA ATACGTTTGG AAAACTGTTC ACGGTAACTA CCTTCGGCGA ATCGCATGGC 
CTTGCGCTGG GCGCGATTGT GGATGGTTGC CCGCCGGGCA TCGAGATCAG CGAGGCGGAT
TTGCAGATCG ACCTTGATCG GCGCAAACCG GGCACCTCGC GCCACACCAC GCAACGGCGC
GAAGCGGATG AGGTCAAGAT TCTATCGGGC GTGTTCGAAG GCAAAACCAC TGGCACACCG
ATTGGCTTGG TTATCGAAAA TACCGACCAA CGCTCGAAAG ATTACGGCAA GATCGCCGAT
CAGTTCCGCC CCGGCCACGC CGATTACACC TACCTGCAAA AATACGGCAT CCGTGACTAT
CGCGGTGGCG GGCGCTCATC GGCGCGGGAA ACCGCCATGC GCGTGGCCGC TGGCGCCATT
GCCCGCAAAG TGCTGCGTGA ATCGTTCGGT GTACACATTC AGGGGTATCT GTCGCAGATC
GGCCCGATCA AAGCCGAGGG TTTTGATGCG GCTGTCATCG AAACCAACCC GTTTTTCTGG
CCCGATGCGG CGCAAGTGCC TGCGCTGGAA GCATTCATGG ATGATCTGCG CAAAAGCGGC
GATTCGGTTG GCGCCAAAGT TACTGTGATG GCCACAGGCT GCCCGCCGGG TTGGGGTGAG
CCGGTGTTCG ATCGGCTCGA TGCCGAACTG GCCCATGCCT TGATGAGCAT CAATGCGGTC
AAGGGCGTGG AAATCGGTTC GGGCTTTGAT TGCGTGGCCG CGCGGGGAAC CGAGTTCCGT
GATGAAATCA CCCCCGATGG GTTTTTGAGT AACCACGCAG GCGGCATTCT CGGTGGCATT
TCCAGTGGGC AGGACATCGT GGCCCATATC GCGCTCAAGC CCACCTCCAG CATCCGCTTG
CCCGGCCAAA GCGTGGACGT AACCGGCGCG GCGGCAGAAG TGATTACCAC AGGTCGCCAC
GATCCCTGCG TCGGCATTCG CGCTACACCA ATCGCCGAAG CCATGATGGC ACTTACCCTG
CTCGATCACG CCCTGCGCCA TCGCGGCCAA TGCGGCGGTG TGAATAGCGG CTCGCCGGTG
ATTCCGGCCA AGAAATAA
 
Protein sequence
MSGNTFGKLF TVTTFGESHG LALGAIVDGC PPGIEISEAD LQIDLDRRKP GTSRHTTQRR 
EADEVKILSG VFEGKTTGTP IGLVIENTDQ RSKDYGKIAD QFRPGHADYT YLQKYGIRDY
RGGGRSSARE TAMRVAAGAI ARKVLRESFG VHIQGYLSQI GPIKAEGFDA AVIETNPFFW
PDAAQVPALE AFMDDLRKSG DSVGAKVTVM ATGCPPGWGE PVFDRLDAEL AHALMSINAV
KGVEIGSGFD CVAARGTEFR DEITPDGFLS NHAGGILGGI SSGQDIVAHI ALKPTSSIRL
PGQSVDVTGA AAEVITTGRH DPCVGIRATP IAEAMMALTL LDHALRHRGQ CGGVNSGSPV
IPAKK