Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1386 |
Symbol | |
ID | 5055206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1249553 |
End bp | 1251517 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640468931 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001153600 |
Protein GI | 145591598 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.383779 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCTT CTCAAAAATT CAAAGCATGG GAGGGGCTAT ACAATAGGTG GGCAGAAGAC CCGGAGGGCT TTTGGCGCGA GTTTATAGAA AAGACCACGC ACCTTATCTA CTGGGCTAAA AAGCCCGAGA GAATTTTTCA GTGGCAACCC CCCGAGCCTT TCAAGTGGTT TGTTGGCGGC TACACAAATG CAGGTTACAG CGCGGTAGAT TACAAGACGG GGCTTCTAGG CGAGAAGATT GCCTACATCT ACCTAAACCC CGAGGCGGGG GCGGAGCGGA AGGTGACATA CGGCGAATTG GCGTCGTATG TCTACAAATT CAGCGCCGCC CTGAGAGCTG CCGGGGTGAA GAAGGGGGAC ACTATCCTCG TCTACATGCC TAACTCAATT GAGGCTGTTG CGGCTATACT GGCTGCGGCG CGTGTAGGAG CCGTCTCCAC CACCGTCTTT GCCGGATTTT CACCGAAGGC AGTGGCCGAT AGGATAGAGC TGGTAGAGCC CAAGATCGTA TTCACCCAAG ACTACTCGCT ACGCAGAGGA AGAAAAATCC CGCTTAAGGC AAATATCGAC GAGGCGTTTA AGATATCAGC GTGGCGGCCA TCCCTCGTGG TGGTAAAGAA GACGGAGGAG GGAGGAGATG TGCCGATGGA AAAGGGGCGG GATATCTGGC TTGAGGAGTT TCTCGAAATG GGGAAGGGCC ACTCGGCGCA TCCCGAGTTT GTAGAGTCCA ACGAGCCCCT CTTCGTCTTG CCCACCTCAG GCACCACGGC AAAGCCCAAG CCCGTGGTAC ACGTACATGG AGGCTACCAG GTATGGATCA TATACGGCGC TCTGCTTGTG TACGGCCTCT CTGCCAACGA TCTTATTTTC AACACAAGCG ACATCGGGTG GATCGTGGGA CAGAGCTATA TAGTTTTCGC GCCGCTGATT ATGGGCGCCA CCTCTATCCT ATTCGACGGC GCTATAGACT ACCCCAAGCC CGACCTATTC TGGGAGATCG TGGAGAAGTA CAAGCCGACG CTGATTTGGA CCTCCCCCAC GGCGGCGAGG CTTTTGATGA GGTACGGCAC GAACTTGGCC ATGAAACACG ACCTCTCATC AGTAACGCGG GTAGTCACGG CTGGTGAGGT TCTGAACCCA GAAGTGTGGC GCTGGCTGTA CGAGGACGTG TTCAGGAAGA GGGTGCCCGT AATAGACCAC TGGTGGCAGA CCGAGCTGGC AGGCCCCACA ATTGGGTACT ACTACGCCCT TGTAAGCGGC ATGCCCCACG GCCTTGAGCA CATGGAGATT AAGCCGGGCT CCGCCGGCGT CCCGCTACCG GGCGTCGAGG TGGAAGTAGT AGACGAGAGG GGCAACCCGG TGCCGCCTGG CCACAAGGGG ACGTTGGTGA TCAAAAGGCC GCATCCCGGC ATGACGCCGA CATTGTGGAG GGACCACCAG CGGTATTTAA ACGACTATTG GGGCAGATAC GAGGGGAAGT TGGTTTACTA CACGGGCGAC GCGGCTCACA TGGATGAAGA CGGCTACATC TGGTTCGCCG GGAGGGCCGA TGAAGTGATT AAAATCGCCG GTCACAGGAT AGGCACTATA GAGGTGGAGT CGGCCCTCGT TTCCCACCCA GCCGTCGCAG AGGCGGCTGT GGTGGGCGTC CCAGACCCGC TGAGGGGGGA GGCAATTGCC GCCTTCGTGG TGCTGAGGCC AGGCCGGCAA CCCACAGAGG ACCTCAAGAA GGATCTAATT GAACATGTGA GGAAGACCTT CGGCCCAATT GCGGTGTTCG CCGGGGTAGA GTTCGTCAAC ATGCTCCCCA AAACCCGTTC GGGGAAGATA ATGAGGAGGG TGCTCAAGAG GCTGTGGACC GGCGAGCCGC TAGGAGATCT CTCAACAATA GAAGACGAGG CATCGATAGA GGAGGTTAAG GAGGCTGTCT CTAAAATGAA GTTTATAAAA ACTGCCGAAT TTTAA
|
Protein sequence | MSSSQKFKAW EGLYNRWAED PEGFWREFIE KTTHLIYWAK KPERIFQWQP PEPFKWFVGG YTNAGYSAVD YKTGLLGEKI AYIYLNPEAG AERKVTYGEL ASYVYKFSAA LRAAGVKKGD TILVYMPNSI EAVAAILAAA RVGAVSTTVF AGFSPKAVAD RIELVEPKIV FTQDYSLRRG RKIPLKANID EAFKISAWRP SLVVVKKTEE GGDVPMEKGR DIWLEEFLEM GKGHSAHPEF VESNEPLFVL PTSGTTAKPK PVVHVHGGYQ VWIIYGALLV YGLSANDLIF NTSDIGWIVG QSYIVFAPLI MGATSILFDG AIDYPKPDLF WEIVEKYKPT LIWTSPTAAR LLMRYGTNLA MKHDLSSVTR VVTAGEVLNP EVWRWLYEDV FRKRVPVIDH WWQTELAGPT IGYYYALVSG MPHGLEHMEI KPGSAGVPLP GVEVEVVDER GNPVPPGHKG TLVIKRPHPG MTPTLWRDHQ RYLNDYWGRY EGKLVYYTGD AAHMDEDGYI WFAGRADEVI KIAGHRIGTI EVESALVSHP AVAEAAVVGV PDPLRGEAIA AFVVLRPGRQ PTEDLKKDLI EHVRKTFGPI AVFAGVEFVN MLPKTRSGKI MRRVLKRLWT GEPLGDLSTI EDEASIEEVK EAVSKMKFIK TAEF
|
| |