Gene Pars_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1804 
Symbol 
ID5055879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1619406 
End bp1620791 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID640469349 
ProductCoA-binding domain-containing protein 
Protein accessionYP_001154007 
Protein GI145592005 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID[TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.688194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0833486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTCAA AACTTCTCGA CCCAGAAAGC GTAGCAGTAG CAGGGGCGTC TCCAAAACAA 
GGCTCTGTGG GCTACGTAAT TTTGGAGAAC CTAGTAAATA GATTTGGAAA AAAGGTATTC
CCAATAAATC CGAAATACGA CGAGGTGGAG CTATGGGGCC GCGTGATTAA GTTCTACAAA
TCAATCTCCG AGGTGCCGGA TCCCGTAGAC GTGGTCGTGA TAGCGACGCC GGCTCCCACC
GTCCCCAAGG TGCTTGAGGA GGCGGGTATT AAAGGCGTTA AAGCCGCGGT GATTGTCAGT
AGCGGCTTCT CTGAGGCAGG AAACACAGAA TTGGAGAACT GGGTTAAAGC CGTGGCGAAG
CAGTATGGTG TAAGAGTCTT AGGGCCGAAC TGTATCGGCG TGTATAACGC CTACTCCGGC
TTCGATACAG TTTTTCTCCC AGCAGAGAGA GCCGGCAGAC CGCCGCCTGG CCCTCTTGCT
TTGATTAGCC AGTCAGGTGC AGTTGCAGCG GCTATAATGG ATTGGGCGGC GAGGAGGAGG
CTGGGGTTAG GCTTTTTAGT TAACTACGGG AACAAGGCAG ATATTACTGA AACCGAGTTA
TTAGAGGGTT TTGCGGCAGA CGACCGGGTA AAAGTAATTA CGATATATAT CGAGGGGTTC
AAATACCCCG GCGAGGCAAG GCAGTTTTTA GAAACTGCAA AGAAGATAGC GGCTAAGAAG
CCTGTGGTGG CCTATAAGGC TGGTAGGGGC GGCGCCGCGC AGAGGGCTGT TAAGAGCCAC
ACCGCAGCAA TGGCGGGGTC GTACGAAATG TACCATGGCC TATTTCAACA AGCGGGCGTA
ATAGAGGCTT CGTCTGTCAG AGAGATGTTT GACATGGCTA AGGCCCTCGC CATGCAGCCC
ACCCCACGGG GAAGGCGGAC TCTCGTCCTT ACAGACAGCG GCGGTATGGG TATTCAAGCT
GTGGACGCGC TGGAGGCTCT GGGGCTTGAG GTGCCTGAGA TCCCGGAGAG CGTCGCCAAG
GAGTTGAAGA GGGAGCTACT GCCTTTTGCC GCTGTGACGA ACCCTGTCGA TGTCACGGGG
AGTACAACAG ATGAGCATTA CAAAATAGTC CTAGACGCGT TGCTGCCAAC TCCGCTTTTT
GACATGGCGC TTGTAGTCAC GTTGATGCAA GTCCCTGGCC TAACAAAAAA CTTAGCCGAC
TACCTAATAG ATGCGAAAAA ATACGGCAAG CCCATAGTCG TGGTGAACTT CGGCGGTAGC
GAACTCGTAC AGAGATTTGA AGAGGTGCTT GAGGACAACG GAATTCCGGT GTACCCCACT
CCCGACAGAG CGGCTAAGGC GCTTTGGGCC TTGTATAAAT ATGGGGAAAT AAGAAAGAGG
TTATGA
 
Protein sequence
MLSKLLDPES VAVAGASPKQ GSVGYVILEN LVNRFGKKVF PINPKYDEVE LWGRVIKFYK 
SISEVPDPVD VVVIATPAPT VPKVLEEAGI KGVKAAVIVS SGFSEAGNTE LENWVKAVAK
QYGVRVLGPN CIGVYNAYSG FDTVFLPAER AGRPPPGPLA LISQSGAVAA AIMDWAARRR
LGLGFLVNYG NKADITETEL LEGFAADDRV KVITIYIEGF KYPGEARQFL ETAKKIAAKK
PVVAYKAGRG GAAQRAVKSH TAAMAGSYEM YHGLFQQAGV IEASSVREMF DMAKALAMQP
TPRGRRTLVL TDSGGMGIQA VDALEALGLE VPEIPESVAK ELKRELLPFA AVTNPVDVTG
STTDEHYKIV LDALLPTPLF DMALVVTLMQ VPGLTKNLAD YLIDAKKYGK PIVVVNFGGS
ELVQRFEEVL EDNGIPVYPT PDRAAKALWA LYKYGEIRKR L