Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1827 |
Symbol | |
ID | 5056177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1637906 |
End bp | 1638934 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640469373 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001154030 |
Protein GI | 145592028 |
COG category | [C] Energy production and conversion |
COG ID | [COG0371] Glycerol dehydrogenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0648145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.20433 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCAAC TTGAGAGTTT TGAGATCCCG AGAACAGTCA TCTTTGGGCC AGGCGCAATT TCGAAAACCC CTCAAGTAGT TGCCAAGCAC AAGGCGGAGA GAATCCTAAT AATATCAGGT AAATCTGTTA CTGCCAACTA CGCCAATGAG GTCGCACATT TGCTATCAGG TTACAGCGTA GACGTGGTAA GATACGACGA GGTAGATACA AGCTATTCGA AATACGACTT AGTGTTGGGC GTCGGGGGCG GGAGGCCTAT TGACGTGGCC AAAGTGTACT CATATCTGCA TAGGGCTCCT CTAATAGTTA TCCCCACTTC GGCCAGCCAC GACGGAATTG CCTCGCCATA CGTGTCGTAT GCCCTATCCC AGAAAATGGC CTCGCATGGG AAAATAGTGG CATCTCCCAT AGCGATAATA GCTGACACCA CCGTAATCCT CAACGCGCCT TCTCGGTTGT TGAAAGCAGG AATAGGAGAC CTCCTTGGAA AAATAGTTGC TGTACGTGAT TGGCAACTTG CCCATAGGCT AAAAGGCGAG GAGTACAGCG AATACGCCGC CCACCTGGCG CTCACCAGCT ATAGAATAGT GGTTTCTAAC GCTTTCAGAA TCAAGAACTT TACTAAGGAG GAAGATGTGA GAGTTTTAGT AAAGGCCCTT ATAGGATGCG GCGTAGCTAT GGGCATTGCA GGTTCATCGC GGCCGTGTAG TGGCTCTGAA CACCTCTTTG CCCACGCCGT CGAGTTACTG CTAGGGGAGA AGAACAACGA GGCCATACAC GGCGAGTTAG TAGCCCTAGG CACTGTGGTA ATGGCCTACC TACATGGCAT GAACTGGCGC CGGATAAAAA GAGTAGCAAA AGAGGTGGGG CTTCCAACTA CTTTGAAACA GATAGGTATA GACGCAGATG TGGCTATAGA GGCCTTAACA ACAGCACACA CCCTCCGCCC AGATCGCTAC ACAATTTTAG GGAGTGGACT AGGGAAAGAG GCAGCCAGAC GCGCCTTGGA AACTACAGAA TTAATATAA
|
Protein sequence | MKQLESFEIP RTVIFGPGAI SKTPQVVAKH KAERILIISG KSVTANYANE VAHLLSGYSV DVVRYDEVDT SYSKYDLVLG VGGGRPIDVA KVYSYLHRAP LIVIPTSASH DGIASPYVSY ALSQKMASHG KIVASPIAII ADTTVILNAP SRLLKAGIGD LLGKIVAVRD WQLAHRLKGE EYSEYAAHLA LTSYRIVVSN AFRIKNFTKE EDVRVLVKAL IGCGVAMGIA GSSRPCSGSE HLFAHAVELL LGEKNNEAIH GELVALGTVV MAYLHGMNWR RIKRVAKEVG LPTTLKQIGI DADVAIEALT TAHTLRPDRY TILGSGLGKE AARRALETTE LI
|
| |