Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0998 |
Symbol | |
ID | 8534145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1075470 |
End bp | 1078286 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 646383382 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_003262881 |
Protein GI | 261855598 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATTCGAAAGA CAACGATAAA GCGCTGCGCG CACGGGTGCG CCTTTTCGGC AACCTTTTGG GCGAAGTGCT GAAAGAACAA ACCGGCGATC ACGTCTTCGA TACCGTCGAA ACGTTACGTC GTGGTTTTAT CAAATTGCGG CTGAAGCATA ATCCTAAATT GCACGCCAAA TTGATGGTTC TGTTGCGCAC TCTGGATCCG GATACGCTCA ATTTTGTCGT GCGCGCGTAC AACCTGTACT TCAGCCTGGT GAATATCGCT GAAGAAGATT TCATGCATCA GCATCGGCGC AAACAGGTTC GACTCGGCCT GCGTTTATGG CGCGGCTCGT TCTACGACAC CATGCGCGAG TTTTCCAAGC AGGGCATGAG CCCGGATGAT TTGCAGACGC TGCTCAACCG ACTCATTTAT ATGCCCGTAT TTACGGCGCA TCCCACCGAG GCAAAACGTC GAACCGTGAT GGATCTGCAG CGCAAAATCT TCTTGCTTTG CGCCGAGTTA GATCATCCGG AAGCAAAAGG GATTGAACGG GATCGTTTGC ATCAACAGGT TAAAAGCGTG ATTCTTTCAT TGCTTAAAAC CAATGAAGTG CGCACCACCC GGCCTGAAGT CCACGATGAG ATTCGTTTGG GCCTGTATTA TTTCAGCACG TCGATCTTTG ATGCCGTTCC GCTGGCTTAC CGCTACCTTG AGCGCGCGGT GGATGTGAAT TTCAACGAAA AATTCCCGGA TGCACCGGTC ACGGTGCCGA GTCTGTTCCG GTTCGGTTCT TGGATTGGTG GCGATCGCGA TGGCAATCCG TATGTAACGC ATGAAGTGAC CACTTTCGCG GTATGTTCGG CAACGCAGAC GATCTTGCAA GAATATCTTG ACCGATTGGC GGGTATGGAC CGCGTGCTCA CCCATTCGTG CCGTTTGTGC CCTGAGATTG ACCTAGAAGG ATTAGGTCTG CATCAGGATG CGGTGGAACT GGGGCTCGCA TCACCAGAAA ATTCAGGTGA TAGTTTCTTT ACGGAAGAGC CCTATCGTCT CAAGTTGCGC ATCATCGGCC AGCGCCTGAA GCACAACCTG GACTACGTGA CGGCCCTGCT GGACAAAAAG CCCATCAACT TGTCCGCTCA TGCCTACGCG CACAAGGATC AGTTCCTGAG CGACCTGTAC CGGATTCGTG ACACGTTGAT TTCCACAGGC GATCAGTTGC TTGCCGATGG CGAGATCAAG GATTTGATCC GCCTGGCGGA GACTTTCGGT TGGCATCTGT TCAAATTGGA TATCCGTCAG GAATCCACAC GCCATACTCA AACCGTGGCT GATATTCTCA AGCAATTGCA GCCGAAGACC GATTACATGG CATTGGACGA AGCCGGTCGT ATGACGCTGC TGACCCAGCT CATCAACAAG AGCCGCCACA AGGCCATCGA CATGGCAGCG TTATCTGCTG AATCTGCCGA AACCCTCGAA GTGTTCAAGG TCAAGCGTGA ACTGATCGAT ACCATCAGCC GCGAATGCTT CGGTACCTAC GTCATTTCAA TGACGCACGC AGCCAGCCAC GTGATGGAAG TCATGTTTCT GGCCGTATTG GCTGGATTGG CGGGCAAGCG TAAATCCCAG TGGTTCTGCG ATATTCAGAT TTCGCCGTTG TTTGAAACCA TCGAAGACTT ACATCAAATC GAGAACGTAT TAAGCGTATT ATTCGAAAAT CCGGTCTACC GTGAATTGAT TCGTGTTTCC GGCGATCTGC AAGAAGCGAT GCTGGGTTAT TCCGATTCCT GCAAGGATGG CGGTTCGCTC GCTTCGGTCT GGAGCCTGTA CAACGCTCAG AAACGTGTTC TGTCGATTAC CCAGAAGAAT GGTATCGAAT GCCGACTGTT CCATGGCCGT GGCGGCACAG TGGCGCGGGG CGGCGGGCCT ACTCACGAGT CGATTCTGTC GCTGCCGGCG GGTACCGTCG AAGGTCAGAT CAAATTTACC GAACAGGGTG AGGTACTCTC ATCCAAGTAC AGTAATACCG AAACGGCTAT TTACGAGATC ACCATGGGTG CGACCGGCCT GATGAAGGCA TCGGCGCATC TGGTGATGGA AAACAATCCG GCACCGGCCG AGCACGAGCA AGTCGTGGCC GAGTTGGCGA CCTACGGCGA GAAGGCATAC CGAGAATTGA CTGACGAAAC GCCATTCTTC TTCAATTACT TTTTCGAAGC GACACCGGTG CGAGAGCTTG GCTTGCTCAA CATCGGCTCG CGACCGGCAT CGCGCAAGGT CGGTGATTTG TCCAAGGCAT CTGTCCGCGC CATTCCCTGG GTATTCGGCT GGTCGCAGTC ACGACATACC TTGCCTGCAT GGTACGGAAT TGGCTCGGCG CTGAGAGCCT GGCGTCAAAG CCATAAGGAT CAGCCCGAGT TGCTGCACAC TCTGTTCAAC GAGTGGCCGT TCTTCCACAG TATGTTGCGC AATACCCAGC TTTCGCTGAC CAAGGGTGAA ATGACCATCG CGCGTGAGTA CGCCAGTCTG GTGGCCGATC AGGCTCAGGC GCTACCGGTA TATGACAAGA TCAGCACCGA GTACTACCGT ACGCTCGATG AATTGTTGCG GGTTGCTCGG GTCGACTCGC TGGTTGAAAT TGATGAATAC ATCGGCACCT CGATGATGCG ACGTAACCCT TATCTGGACG TGCTCAACCA TATCCAGATC GTATTGCTCC GCCGCTACCG GGATGATTCC GAGCCGGAAG CCGAACGGCA GAAGTGGTTG CTGCCGCTTT TACGCTCGAT CAACGCCATT GCCTCAGGGA TGCGTAATAC CGGCTAA
|
Protein sequence | MKKNSKDNDK ALRARVRLFG NLLGEVLKEQ TGDHVFDTVE TLRRGFIKLR LKHNPKLHAK LMVLLRTLDP DTLNFVVRAY NLYFSLVNIA EEDFMHQHRR KQVRLGLRLW RGSFYDTMRE FSKQGMSPDD LQTLLNRLIY MPVFTAHPTE AKRRTVMDLQ RKIFLLCAEL DHPEAKGIER DRLHQQVKSV ILSLLKTNEV RTTRPEVHDE IRLGLYYFST SIFDAVPLAY RYLERAVDVN FNEKFPDAPV TVPSLFRFGS WIGGDRDGNP YVTHEVTTFA VCSATQTILQ EYLDRLAGMD RVLTHSCRLC PEIDLEGLGL HQDAVELGLA SPENSGDSFF TEEPYRLKLR IIGQRLKHNL DYVTALLDKK PINLSAHAYA HKDQFLSDLY RIRDTLISTG DQLLADGEIK DLIRLAETFG WHLFKLDIRQ ESTRHTQTVA DILKQLQPKT DYMALDEAGR MTLLTQLINK SRHKAIDMAA LSAESAETLE VFKVKRELID TISRECFGTY VISMTHAASH VMEVMFLAVL AGLAGKRKSQ WFCDIQISPL FETIEDLHQI ENVLSVLFEN PVYRELIRVS GDLQEAMLGY SDSCKDGGSL ASVWSLYNAQ KRVLSITQKN GIECRLFHGR GGTVARGGGP THESILSLPA GTVEGQIKFT EQGEVLSSKY SNTETAIYEI TMGATGLMKA SAHLVMENNP APAEHEQVVA ELATYGEKAY RELTDETPFF FNYFFEATPV RELGLLNIGS RPASRKVGDL SKASVRAIPW VFGWSQSRHT LPAWYGIGSA LRAWRQSHKD QPELLHTLFN EWPFFHSMLR NTQLSLTKGE MTIAREYASL VADQAQALPV YDKISTEYYR TLDELLRVAR VDSLVEIDEY IGTSMMRRNP YLDVLNHIQI VLLRRYRDDS EPEAERQKWL LPLLRSINAI ASGMRNTG
|
| |