Gene Pars_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0642 
Symbol 
ID5054396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp571130 
End bp572317 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content49% 
IMG OID640468201 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_001152885 
Protein GI145590883 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.440176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACCT ACCCTCTCAG AGGTGTAAGA ATTCTAGATC TCACAGCGGC CATGGCTGGT 
CCTTTTGCAA CTATGCTACT TGCTGACTTG GGCGCAGATG TGATTAAGAT AGAGCCGCCT
GAAGGGGACC ACGCTAGGGA CTGGGGGCCG CCTTCATATG GTGAGAAGTA CAGCGCGTAC
TTTGCAAGCG TTAACAGAGG CAAAAAGTCC ATTGTGCTGG ATTTGAAGAA GGCAGAGGCA
AGAGAAGTCT TCTACCGGCT AGTCAAGACG GCCAGCGCTG TTGTTGAGAA CTTCCGACCT
GGCGTGGCTC AGAAACTAGG CGTAGACTAT CATGCAGTTA AGCAACACAA CCCCAATATT
GTATACTGCT CCATTTCGGG ATTTGGCGAG GGGCCTTATA GAGATCTCCC CGCATATGAT
CTAGTAGCGC TGGCAATGTC AGGTCTTATG GATTTGACTG GTGAGCCGGA GGGCCCCCCG
GTGAAATTCG CAGTGCCTAT TACCGACATA GCTGCGGGGT TCTACTGCGC CTTATCAATA
ATAACGTCTG TTTTAACAAA TCGCCCAGGA TATATCGAAA TCCCGCTCAT TGAGGCGGCT
ATCTCGTTGT TAACCCACCA GGCGGGTTAT TACTTCGCGA GCGGGGTGCC GCCTAGGCGT
ATGGGTAGCG CACACCCGAC AATAGTCCCA TACCAAGCTT TTAGGGCCAA AGACGGCTAC
TTCGTATTGG CAGTTGGTAG TGACCATTTA TGGAAAAAGT TCTGCGAGGC TATTGGAAGG
CCGGAGTTAG CCGACGACCC CCGATTTAAT ACTAACACAA AGAGAGTACA AAATAGGGAG
GAGTTAGTAA AGATGCTGGA AGAGCTATTT CTAGAGAAAG AGGTGAATTA TTGGGTATCG
CTAATGTGGC AAAATGGAAT TCCTGCGGCC CCTGTGTACA ACTTGACACA GGTCTTCTCA
GATCCTCATG TGCGATACAG AAAAATTGTA GTTGAAAGCC AGGGCCCATT TGGCGTGATC
AAGACATTAA AATCGCCCAT AAATGCTGAA TCGATAAAAG TAGGAAACTA CACGCCGCCC
CCCTTACTAG GTCAACACAC TGCTGAAATA CTAAAAGAAC TAGGCTACAC AGAAGAAGAA
ATAGCTAAAC TTGCCGAGAG GGGGGCAATA ATCCTTCAAA AACACTAG
 
Protein sequence
MHTYPLRGVR ILDLTAAMAG PFATMLLADL GADVIKIEPP EGDHARDWGP PSYGEKYSAY 
FASVNRGKKS IVLDLKKAEA REVFYRLVKT ASAVVENFRP GVAQKLGVDY HAVKQHNPNI
VYCSISGFGE GPYRDLPAYD LVALAMSGLM DLTGEPEGPP VKFAVPITDI AAGFYCALSI
ITSVLTNRPG YIEIPLIEAA ISLLTHQAGY YFASGVPPRR MGSAHPTIVP YQAFRAKDGY
FVLAVGSDHL WKKFCEAIGR PELADDPRFN TNTKRVQNRE ELVKMLEELF LEKEVNYWVS
LMWQNGIPAA PVYNLTQVFS DPHVRYRKIV VESQGPFGVI KTLKSPINAE SIKVGNYTPP
PLLGQHTAEI LKELGYTEEE IAKLAERGAI ILQKH