Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1804 |
Symbol | |
ID | 5055879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1619406 |
End bp | 1620791 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469349 |
Product | CoA-binding domain-containing protein |
Protein accession | YP_001154007 |
Protein GI | 145592005 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.688194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0833486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTTCAA AACTTCTCGA CCCAGAAAGC GTAGCAGTAG CAGGGGCGTC TCCAAAACAA GGCTCTGTGG GCTACGTAAT TTTGGAGAAC CTAGTAAATA GATTTGGAAA AAAGGTATTC CCAATAAATC CGAAATACGA CGAGGTGGAG CTATGGGGCC GCGTGATTAA GTTCTACAAA TCAATCTCCG AGGTGCCGGA TCCCGTAGAC GTGGTCGTGA TAGCGACGCC GGCTCCCACC GTCCCCAAGG TGCTTGAGGA GGCGGGTATT AAAGGCGTTA AAGCCGCGGT GATTGTCAGT AGCGGCTTCT CTGAGGCAGG AAACACAGAA TTGGAGAACT GGGTTAAAGC CGTGGCGAAG CAGTATGGTG TAAGAGTCTT AGGGCCGAAC TGTATCGGCG TGTATAACGC CTACTCCGGC TTCGATACAG TTTTTCTCCC AGCAGAGAGA GCCGGCAGAC CGCCGCCTGG CCCTCTTGCT TTGATTAGCC AGTCAGGTGC AGTTGCAGCG GCTATAATGG ATTGGGCGGC GAGGAGGAGG CTGGGGTTAG GCTTTTTAGT TAACTACGGG AACAAGGCAG ATATTACTGA AACCGAGTTA TTAGAGGGTT TTGCGGCAGA CGACCGGGTA AAAGTAATTA CGATATATAT CGAGGGGTTC AAATACCCCG GCGAGGCAAG GCAGTTTTTA GAAACTGCAA AGAAGATAGC GGCTAAGAAG CCTGTGGTGG CCTATAAGGC TGGTAGGGGC GGCGCCGCGC AGAGGGCTGT TAAGAGCCAC ACCGCAGCAA TGGCGGGGTC GTACGAAATG TACCATGGCC TATTTCAACA AGCGGGCGTA ATAGAGGCTT CGTCTGTCAG AGAGATGTTT GACATGGCTA AGGCCCTCGC CATGCAGCCC ACCCCACGGG GAAGGCGGAC TCTCGTCCTT ACAGACAGCG GCGGTATGGG TATTCAAGCT GTGGACGCGC TGGAGGCTCT GGGGCTTGAG GTGCCTGAGA TCCCGGAGAG CGTCGCCAAG GAGTTGAAGA GGGAGCTACT GCCTTTTGCC GCTGTGACGA ACCCTGTCGA TGTCACGGGG AGTACAACAG ATGAGCATTA CAAAATAGTC CTAGACGCGT TGCTGCCAAC TCCGCTTTTT GACATGGCGC TTGTAGTCAC GTTGATGCAA GTCCCTGGCC TAACAAAAAA CTTAGCCGAC TACCTAATAG ATGCGAAAAA ATACGGCAAG CCCATAGTCG TGGTGAACTT CGGCGGTAGC GAACTCGTAC AGAGATTTGA AGAGGTGCTT GAGGACAACG GAATTCCGGT GTACCCCACT CCCGACAGAG CGGCTAAGGC GCTTTGGGCC TTGTATAAAT ATGGGGAAAT AAGAAAGAGG TTATGA
|
Protein sequence | MLSKLLDPES VAVAGASPKQ GSVGYVILEN LVNRFGKKVF PINPKYDEVE LWGRVIKFYK SISEVPDPVD VVVIATPAPT VPKVLEEAGI KGVKAAVIVS SGFSEAGNTE LENWVKAVAK QYGVRVLGPN CIGVYNAYSG FDTVFLPAER AGRPPPGPLA LISQSGAVAA AIMDWAARRR LGLGFLVNYG NKADITETEL LEGFAADDRV KVITIYIEGF KYPGEARQFL ETAKKIAAKK PVVAYKAGRG GAAQRAVKSH TAAMAGSYEM YHGLFQQAGV IEASSVREMF DMAKALAMQP TPRGRRTLVL TDSGGMGIQA VDALEALGLE VPEIPESVAK ELKRELLPFA AVTNPVDVTG STTDEHYKIV LDALLPTPLF DMALVVTLMQ VPGLTKNLAD YLIDAKKYGK PIVVVNFGGS ELVQRFEEVL EDNGIPVYPT PDRAAKALWA LYKYGEIRKR L
|
| |