Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2200 |
Symbol | |
ID | 5055473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1971515 |
End bp | 1973155 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469752 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001154398 |
Protein GI | 145592396 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAGGT ACAATCTCAC ATTAAATAAG ATTTGGCAGT ATGTAAAAGA GATAAACGGC GATGTAGAAG TTGCGCATCT CCCCCCGCAC GGCAATAACA TAAGATCGAC ATATGCCAGG GAATACGAGA GGACTCTCCG GCTGGCCGAC GGGCTTAGGC GCTTAGGCAT CGGGCCTGGG GACAAGGTAG CCACTATGGA CTGGAATACA ATATGGCACT TCGACCTCTA CTGGGCGGTC CCCGCGATGG GCGCCATACT ACACCCCCTA AACGTCCGTC TCGCTCCGGA GGACTTGGTG TACATAATCA ACCACGCCGG CGACAAGGCC TTGGTATACC ACAGGGACTT CGCCCCCCTC GTGGAGAAGA TTAGGCCGTA CCTCAAGACC GTCCAGATAT ACATACAGAT ATCAGACGGG GCCGGCGCGG TGGGCAAAGA CCCGGAAATA GAAGATGTGA TGAAAAGCGG AGAGCCAAGG CCCTTCCCCG ATCTCAGCGA GGACACCATC GCGACAATTG GATACACCAG CGGGACTACC GGCAAGCCGA AGGGCGCCTA CTTCACCCAC AGGGCGCTAA CGCTACACAC CCTGTCCAGC GCCTTGATGT TTTCAGTGGC TCGGGGTTTC GCGAGGCCTG AGTGCGCTGA GGAGGTGTGT ACCTTCCTAC AGCTGGTCCC CATGTTCCAC GTCCACGGCT GGGGCACGCC TTGGACCTTC GCCCTTATGG GGTGGAGGCA AGTGTACCCC GGCCGGTTTG ACCCCAACCA CGTGGTTAAG CTAATAGCGG AAGAGAGGGT GAAGAGCTTG GCAGGCGTGC CGACAATGCT CTATATGTTG CTCACGGCGC CCGAGTTTCC CAAGTACGTA AACAGGATTA GAGAGGTGAA GCCAATATTT GTCGTAGGCG GCGCAGCCCT CCCGAAGGAG CTGGCAAAAA GAGCGGCCGA GGCCGGGTTC ATCCCAAGAG TTGGCTACGG ACTCACGGAG ACAGCGCCGG TCCTGACGCT TGGGTATTTC AGACCCACGG AGAAGTTGCC TCAAGACGTC GAGGAGTACT ACAGCGTCCT AACAGCGACG GGTCTGCCCA TACCCCTTGT GGATCTCGCC GTGGTTGACG AGAATCTCAA CCCCGTCCCC CGCGACGGAA GGACTATGGG TGAAATAGTT GTAAAGGCGC CTTGGGTAAC GCCTGAATAC TTGGGAGACC CCGAGAAGAC CAAGGAGTCT TTCCGAGGGG GCTGGTTCAG AACTGGCGAC GTCGCTGTGT GGTATCCAGA CGGCCGCATC AGGATAGTGG ACAGGGCCAA AGACGTTATC AAATCCGGGG GCGAGTGGAT CTCCTCCCTG CAACTAGAGG ACTTAATCGC CACGCACCCC GCCGTCGCGC AAGTCGCAGT TATCGGAGTC CCGCACGAGA AATGGGGCGA GCGCCCAGTC GCCGTGGTGG TGCTCAAGCC GGGCGCCGCG GCCACGGAGC AAGACATAAT CAACCACTTG CAGAAATTCG TCGACGCGGG GAAGATCCCC AAGTGGTGGC TACCCGACAA GGTGATATTC GTCAACCAGC TACCGCTCAC CGGCACAGGG AAGATAGACA AGAAAGTACT CAAGGAGCAG TTCAGGAACA CGCTGAAATA G
|
Protein sequence | MERYNLTLNK IWQYVKEING DVEVAHLPPH GNNIRSTYAR EYERTLRLAD GLRRLGIGPG DKVATMDWNT IWHFDLYWAV PAMGAILHPL NVRLAPEDLV YIINHAGDKA LVYHRDFAPL VEKIRPYLKT VQIYIQISDG AGAVGKDPEI EDVMKSGEPR PFPDLSEDTI ATIGYTSGTT GKPKGAYFTH RALTLHTLSS ALMFSVARGF ARPECAEEVC TFLQLVPMFH VHGWGTPWTF ALMGWRQVYP GRFDPNHVVK LIAEERVKSL AGVPTMLYML LTAPEFPKYV NRIREVKPIF VVGGAALPKE LAKRAAEAGF IPRVGYGLTE TAPVLTLGYF RPTEKLPQDV EEYYSVLTAT GLPIPLVDLA VVDENLNPVP RDGRTMGEIV VKAPWVTPEY LGDPEKTKES FRGGWFRTGD VAVWYPDGRI RIVDRAKDVI KSGGEWISSL QLEDLIATHP AVAQVAVIGV PHEKWGERPV AVVVLKPGAA ATEQDIINHL QKFVDAGKIP KWWLPDKVIF VNQLPLTGTG KIDKKVLKEQ FRNTLK
|
| |