Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0005 |
Symbol | |
ID | 6742783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 4258 |
End bp | 5445 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642749785 |
Product | dihydropteroate synthase |
Protein accession | YP_002120675 |
Protein GI | 195952385 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00021463 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA CATTTTTAAA TACGTCAAAA GACGTCCATG TATATATAAA AAGAAATATC CATTACAGTA TTAAAACCGA AGACGTATCT AAAATAGTGC CCTCTTTTGC TATAAAGTTT GAGGGCTTTT TAGAGAAAAA CGTGCTAGAA AATGCTGCTC GTTTTGGTGT ATTTTGCCAC AATCAGGGGA AATATGCATT TGTTATAGGA AATAAGTTTG GTATTTTTGA GCTTTTAAAA GCCACAAAAC CAAAAATAGC TAAAGAACTT ATAAAGCATA TTATTAATTT TGGCAAAGAG CACCGTTACA ACATATCTTA CAACACAAAA GTTTTGCACC TTTCAGACAT GAAACCACTT GTAATGGGAG TTATAAATAT AACCAAAGAT TCCTTTTACG CACCTTCAAG GGTGGATGAG AAAGATATAC TTTTTAGAGT AGAAGAGTTT ATAAAAGAGG GTGCAGATAT AATAGATATA GGAGCTCAAT CTACTAGACC TGGAGCAGAG GAAATATCTT CTGAGGAAGA GGTTCGTAAG CTTTTAGAAC CCTTAAGAAA GATAAGAAAA GAGTTTAAGA ATGTGTGGAT ATCCATAGAT ACATATTTTT CCAATACTGC TAGGGTATGT CTTGAAGAAG GGGCAGATAT TATAAACGAT ATAAGTGGTG GAGTTTTTGA CGATGATATA CTAAAAACTA TAGCTGTTTA TAACTGCCCT TACATCATAG GACATTCCTC TTCGTACAAA CCAAATCAGT GGCCCCATAC AGATTTTGAG TATGAGGATA TAACCCTTGA AATTATAAAT CATTTTAAAA GCCAAGAGAC AAAGCTTTTA GAACTTGGGT ATAATCTCTT TAACGGCATA ATCATAGACC CATGTATAGG CTTTGCCAAA AAACCTATGC ACAACTTAGA AATTTTAAAT CAGATAGAAG CTCTTAGAAG TCTTGATAGG CCAATTCTTA TAGGTACTTC AAGAAAATCT TTTATAGGTA TAGTGATAAA AGAGTTTTTA CAAAAAGAAT CTATACCAGC CCCAGAGCAA AGGCTTGTAG GAAGTCTTGG GTCTATAGCC CAAAGTATTA CAAAAAACGC TTGCCATATT GTAAGAACCC ATGATGTAGC TACTACTAAA GAGTTTATAG CGCTTTTAGA TGCAATAAGG AATTATCATT ATGCTTGA
|
Protein sequence | MKITFLNTSK DVHVYIKRNI HYSIKTEDVS KIVPSFAIKF EGFLEKNVLE NAARFGVFCH NQGKYAFVIG NKFGIFELLK ATKPKIAKEL IKHIINFGKE HRYNISYNTK VLHLSDMKPL VMGVINITKD SFYAPSRVDE KDILFRVEEF IKEGADIIDI GAQSTRPGAE EISSEEEVRK LLEPLRKIRK EFKNVWISID TYFSNTARVC LEEGADIIND ISGGVFDDDI LKTIAVYNCP YIIGHSSSYK PNQWPHTDFE YEDITLEIIN HFKSQETKLL ELGYNLFNGI IIDPCIGFAK KPMHNLEILN QIEALRSLDR PILIGTSRKS FIGIVIKEFL QKESIPAPEQ RLVGSLGSIA QSITKNACHI VRTHDVATTK EFIALLDAIR NYHYA
|
| |