Gene Pars_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2200 
Symbol 
ID5055473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1971515 
End bp1973155 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content58% 
IMG OID640469752 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001154398 
Protein GI145592396 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGGT ACAATCTCAC ATTAAATAAG ATTTGGCAGT ATGTAAAAGA GATAAACGGC 
GATGTAGAAG TTGCGCATCT CCCCCCGCAC GGCAATAACA TAAGATCGAC ATATGCCAGG
GAATACGAGA GGACTCTCCG GCTGGCCGAC GGGCTTAGGC GCTTAGGCAT CGGGCCTGGG
GACAAGGTAG CCACTATGGA CTGGAATACA ATATGGCACT TCGACCTCTA CTGGGCGGTC
CCCGCGATGG GCGCCATACT ACACCCCCTA AACGTCCGTC TCGCTCCGGA GGACTTGGTG
TACATAATCA ACCACGCCGG CGACAAGGCC TTGGTATACC ACAGGGACTT CGCCCCCCTC
GTGGAGAAGA TTAGGCCGTA CCTCAAGACC GTCCAGATAT ACATACAGAT ATCAGACGGG
GCCGGCGCGG TGGGCAAAGA CCCGGAAATA GAAGATGTGA TGAAAAGCGG AGAGCCAAGG
CCCTTCCCCG ATCTCAGCGA GGACACCATC GCGACAATTG GATACACCAG CGGGACTACC
GGCAAGCCGA AGGGCGCCTA CTTCACCCAC AGGGCGCTAA CGCTACACAC CCTGTCCAGC
GCCTTGATGT TTTCAGTGGC TCGGGGTTTC GCGAGGCCTG AGTGCGCTGA GGAGGTGTGT
ACCTTCCTAC AGCTGGTCCC CATGTTCCAC GTCCACGGCT GGGGCACGCC TTGGACCTTC
GCCCTTATGG GGTGGAGGCA AGTGTACCCC GGCCGGTTTG ACCCCAACCA CGTGGTTAAG
CTAATAGCGG AAGAGAGGGT GAAGAGCTTG GCAGGCGTGC CGACAATGCT CTATATGTTG
CTCACGGCGC CCGAGTTTCC CAAGTACGTA AACAGGATTA GAGAGGTGAA GCCAATATTT
GTCGTAGGCG GCGCAGCCCT CCCGAAGGAG CTGGCAAAAA GAGCGGCCGA GGCCGGGTTC
ATCCCAAGAG TTGGCTACGG ACTCACGGAG ACAGCGCCGG TCCTGACGCT TGGGTATTTC
AGACCCACGG AGAAGTTGCC TCAAGACGTC GAGGAGTACT ACAGCGTCCT AACAGCGACG
GGTCTGCCCA TACCCCTTGT GGATCTCGCC GTGGTTGACG AGAATCTCAA CCCCGTCCCC
CGCGACGGAA GGACTATGGG TGAAATAGTT GTAAAGGCGC CTTGGGTAAC GCCTGAATAC
TTGGGAGACC CCGAGAAGAC CAAGGAGTCT TTCCGAGGGG GCTGGTTCAG AACTGGCGAC
GTCGCTGTGT GGTATCCAGA CGGCCGCATC AGGATAGTGG ACAGGGCCAA AGACGTTATC
AAATCCGGGG GCGAGTGGAT CTCCTCCCTG CAACTAGAGG ACTTAATCGC CACGCACCCC
GCCGTCGCGC AAGTCGCAGT TATCGGAGTC CCGCACGAGA AATGGGGCGA GCGCCCAGTC
GCCGTGGTGG TGCTCAAGCC GGGCGCCGCG GCCACGGAGC AAGACATAAT CAACCACTTG
CAGAAATTCG TCGACGCGGG GAAGATCCCC AAGTGGTGGC TACCCGACAA GGTGATATTC
GTCAACCAGC TACCGCTCAC CGGCACAGGG AAGATAGACA AGAAAGTACT CAAGGAGCAG
TTCAGGAACA CGCTGAAATA G
 
Protein sequence
MERYNLTLNK IWQYVKEING DVEVAHLPPH GNNIRSTYAR EYERTLRLAD GLRRLGIGPG 
DKVATMDWNT IWHFDLYWAV PAMGAILHPL NVRLAPEDLV YIINHAGDKA LVYHRDFAPL
VEKIRPYLKT VQIYIQISDG AGAVGKDPEI EDVMKSGEPR PFPDLSEDTI ATIGYTSGTT
GKPKGAYFTH RALTLHTLSS ALMFSVARGF ARPECAEEVC TFLQLVPMFH VHGWGTPWTF
ALMGWRQVYP GRFDPNHVVK LIAEERVKSL AGVPTMLYML LTAPEFPKYV NRIREVKPIF
VVGGAALPKE LAKRAAEAGF IPRVGYGLTE TAPVLTLGYF RPTEKLPQDV EEYYSVLTAT
GLPIPLVDLA VVDENLNPVP RDGRTMGEIV VKAPWVTPEY LGDPEKTKES FRGGWFRTGD
VAVWYPDGRI RIVDRAKDVI KSGGEWISSL QLEDLIATHP AVAQVAVIGV PHEKWGERPV
AVVVLKPGAA ATEQDIINHL QKFVDAGKIP KWWLPDKVIF VNQLPLTGTG KIDKKVLKEQ
FRNTLK