Gene Pars_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1386 
Symbol 
ID5055206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1249553 
End bp1251517 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content56% 
IMG OID640468931 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001153600 
Protein GI145591598 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.383779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCTT CTCAAAAATT CAAAGCATGG GAGGGGCTAT ACAATAGGTG GGCAGAAGAC 
CCGGAGGGCT TTTGGCGCGA GTTTATAGAA AAGACCACGC ACCTTATCTA CTGGGCTAAA
AAGCCCGAGA GAATTTTTCA GTGGCAACCC CCCGAGCCTT TCAAGTGGTT TGTTGGCGGC
TACACAAATG CAGGTTACAG CGCGGTAGAT TACAAGACGG GGCTTCTAGG CGAGAAGATT
GCCTACATCT ACCTAAACCC CGAGGCGGGG GCGGAGCGGA AGGTGACATA CGGCGAATTG
GCGTCGTATG TCTACAAATT CAGCGCCGCC CTGAGAGCTG CCGGGGTGAA GAAGGGGGAC
ACTATCCTCG TCTACATGCC TAACTCAATT GAGGCTGTTG CGGCTATACT GGCTGCGGCG
CGTGTAGGAG CCGTCTCCAC CACCGTCTTT GCCGGATTTT CACCGAAGGC AGTGGCCGAT
AGGATAGAGC TGGTAGAGCC CAAGATCGTA TTCACCCAAG ACTACTCGCT ACGCAGAGGA
AGAAAAATCC CGCTTAAGGC AAATATCGAC GAGGCGTTTA AGATATCAGC GTGGCGGCCA
TCCCTCGTGG TGGTAAAGAA GACGGAGGAG GGAGGAGATG TGCCGATGGA AAAGGGGCGG
GATATCTGGC TTGAGGAGTT TCTCGAAATG GGGAAGGGCC ACTCGGCGCA TCCCGAGTTT
GTAGAGTCCA ACGAGCCCCT CTTCGTCTTG CCCACCTCAG GCACCACGGC AAAGCCCAAG
CCCGTGGTAC ACGTACATGG AGGCTACCAG GTATGGATCA TATACGGCGC TCTGCTTGTG
TACGGCCTCT CTGCCAACGA TCTTATTTTC AACACAAGCG ACATCGGGTG GATCGTGGGA
CAGAGCTATA TAGTTTTCGC GCCGCTGATT ATGGGCGCCA CCTCTATCCT ATTCGACGGC
GCTATAGACT ACCCCAAGCC CGACCTATTC TGGGAGATCG TGGAGAAGTA CAAGCCGACG
CTGATTTGGA CCTCCCCCAC GGCGGCGAGG CTTTTGATGA GGTACGGCAC GAACTTGGCC
ATGAAACACG ACCTCTCATC AGTAACGCGG GTAGTCACGG CTGGTGAGGT TCTGAACCCA
GAAGTGTGGC GCTGGCTGTA CGAGGACGTG TTCAGGAAGA GGGTGCCCGT AATAGACCAC
TGGTGGCAGA CCGAGCTGGC AGGCCCCACA ATTGGGTACT ACTACGCCCT TGTAAGCGGC
ATGCCCCACG GCCTTGAGCA CATGGAGATT AAGCCGGGCT CCGCCGGCGT CCCGCTACCG
GGCGTCGAGG TGGAAGTAGT AGACGAGAGG GGCAACCCGG TGCCGCCTGG CCACAAGGGG
ACGTTGGTGA TCAAAAGGCC GCATCCCGGC ATGACGCCGA CATTGTGGAG GGACCACCAG
CGGTATTTAA ACGACTATTG GGGCAGATAC GAGGGGAAGT TGGTTTACTA CACGGGCGAC
GCGGCTCACA TGGATGAAGA CGGCTACATC TGGTTCGCCG GGAGGGCCGA TGAAGTGATT
AAAATCGCCG GTCACAGGAT AGGCACTATA GAGGTGGAGT CGGCCCTCGT TTCCCACCCA
GCCGTCGCAG AGGCGGCTGT GGTGGGCGTC CCAGACCCGC TGAGGGGGGA GGCAATTGCC
GCCTTCGTGG TGCTGAGGCC AGGCCGGCAA CCCACAGAGG ACCTCAAGAA GGATCTAATT
GAACATGTGA GGAAGACCTT CGGCCCAATT GCGGTGTTCG CCGGGGTAGA GTTCGTCAAC
ATGCTCCCCA AAACCCGTTC GGGGAAGATA ATGAGGAGGG TGCTCAAGAG GCTGTGGACC
GGCGAGCCGC TAGGAGATCT CTCAACAATA GAAGACGAGG CATCGATAGA GGAGGTTAAG
GAGGCTGTCT CTAAAATGAA GTTTATAAAA ACTGCCGAAT TTTAA
 
Protein sequence
MSSSQKFKAW EGLYNRWAED PEGFWREFIE KTTHLIYWAK KPERIFQWQP PEPFKWFVGG 
YTNAGYSAVD YKTGLLGEKI AYIYLNPEAG AERKVTYGEL ASYVYKFSAA LRAAGVKKGD
TILVYMPNSI EAVAAILAAA RVGAVSTTVF AGFSPKAVAD RIELVEPKIV FTQDYSLRRG
RKIPLKANID EAFKISAWRP SLVVVKKTEE GGDVPMEKGR DIWLEEFLEM GKGHSAHPEF
VESNEPLFVL PTSGTTAKPK PVVHVHGGYQ VWIIYGALLV YGLSANDLIF NTSDIGWIVG
QSYIVFAPLI MGATSILFDG AIDYPKPDLF WEIVEKYKPT LIWTSPTAAR LLMRYGTNLA
MKHDLSSVTR VVTAGEVLNP EVWRWLYEDV FRKRVPVIDH WWQTELAGPT IGYYYALVSG
MPHGLEHMEI KPGSAGVPLP GVEVEVVDER GNPVPPGHKG TLVIKRPHPG MTPTLWRDHQ
RYLNDYWGRY EGKLVYYTGD AAHMDEDGYI WFAGRADEVI KIAGHRIGTI EVESALVSHP
AVAEAAVVGV PDPLRGEAIA AFVVLRPGRQ PTEDLKKDLI EHVRKTFGPI AVFAGVEFVN
MLPKTRSGKI MRRVLKRLWT GEPLGDLSTI EDEASIEEVK EAVSKMKFIK TAEF