Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1833 |
Symbol | |
ID | 8534991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1966984 |
End bp | 1968081 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646384214 |
Product | chorismate synthase |
Protein accession | YP_003263702 |
Protein GI | 261856419 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.306795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCA ATACGTTTGG AAAACTGTTC ACGGTAACTA CCTTCGGCGA ATCGCATGGC CTTGCGCTGG GCGCGATTGT GGATGGTTGC CCGCCGGGCA TCGAGATCAG CGAGGCGGAT TTGCAGATCG ACCTTGATCG GCGCAAACCG GGCACCTCGC GCCACACCAC GCAACGGCGC GAAGCGGATG AGGTCAAGAT TCTATCGGGC GTGTTCGAAG GCAAAACCAC TGGCACACCG ATTGGCTTGG TTATCGAAAA TACCGACCAA CGCTCGAAAG ATTACGGCAA GATCGCCGAT CAGTTCCGCC CCGGCCACGC CGATTACACC TACCTGCAAA AATACGGCAT CCGTGACTAT CGCGGTGGCG GGCGCTCATC GGCGCGGGAA ACCGCCATGC GCGTGGCCGC TGGCGCCATT GCCCGCAAAG TGCTGCGTGA ATCGTTCGGT GTACACATTC AGGGGTATCT GTCGCAGATC GGCCCGATCA AAGCCGAGGG TTTTGATGCG GCTGTCATCG AAACCAACCC GTTTTTCTGG CCCGATGCGG CGCAAGTGCC TGCGCTGGAA GCATTCATGG ATGATCTGCG CAAAAGCGGC GATTCGGTTG GCGCCAAAGT TACTGTGATG GCCACAGGCT GCCCGCCGGG TTGGGGTGAG CCGGTGTTCG ATCGGCTCGA TGCCGAACTG GCCCATGCCT TGATGAGCAT CAATGCGGTC AAGGGCGTGG AAATCGGTTC GGGCTTTGAT TGCGTGGCCG CGCGGGGAAC CGAGTTCCGT GATGAAATCA CCCCCGATGG GTTTTTGAGT AACCACGCAG GCGGCATTCT CGGTGGCATT TCCAGTGGGC AGGACATCGT GGCCCATATC GCGCTCAAGC CCACCTCCAG CATCCGCTTG CCCGGCCAAA GCGTGGACGT AACCGGCGCG GCGGCAGAAG TGATTACCAC AGGTCGCCAC GATCCCTGCG TCGGCATTCG CGCTACACCA ATCGCCGAAG CCATGATGGC ACTTACCCTG CTCGATCACG CCCTGCGCCA TCGCGGCCAA TGCGGCGGTG TGAATAGCGG CTCGCCGGTG ATTCCGGCCA AGAAATAA
|
Protein sequence | MSGNTFGKLF TVTTFGESHG LALGAIVDGC PPGIEISEAD LQIDLDRRKP GTSRHTTQRR EADEVKILSG VFEGKTTGTP IGLVIENTDQ RSKDYGKIAD QFRPGHADYT YLQKYGIRDY RGGGRSSARE TAMRVAAGAI ARKVLRESFG VHIQGYLSQI GPIKAEGFDA AVIETNPFFW PDAAQVPALE AFMDDLRKSG DSVGAKVTVM ATGCPPGWGE PVFDRLDAEL AHALMSINAV KGVEIGSGFD CVAARGTEFR DEITPDGFLS NHAGGILGGI SSGQDIVAHI ALKPTSSIRL PGQSVDVTGA AAEVITTGRH DPCVGIRATP IAEAMMALTL LDHALRHRGQ CGGVNSGSPV IPAKK
|
| |