Gene Pars_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1591 
Symbol 
ID5054901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1440881 
End bp1442317 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content55% 
IMG OID640469132 
ProductABC transporter related 
Protein accessionYP_001153797 
Protein GI145591795 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.243746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTTT CTCTCAAGGA AATTCACAAA ATTTTTTCAG ACGGAACCCA CGCCTTGCGT 
GGGGTTTCTC TTGATATCTA TCCAGGGGAG GTTTTGGCAT TGCTAGGCGA GAACGGCGCA
GGAAAGACAA CGCTGATGAA GATCCTTGCG GGCATCTACA AGCCCACATC TGGGGAGATA
TACATCGACG GGAAAAAGGC TAGGTTCAAA AACGCCCGGG AGGCTTTGCG CCTCGGCATA
GCCATGGTGC ACCAACACCT TTCCCTCATA CCAGGGCTTA CCGCACTTGA GAATATCGCC
GTGCTGGAGG GGGCGGGCCT AGGGCCTATA TCAGGAGAGG TGAGGAAGAG GGCAGAGGCC
ATCGCGGCGG GGCTGGGTTT TGAAATTGAC TGGGATAGAG ACGTGGAGGA GCTCCCCCTA
GGCGTGAGGC AGAGAGTGGA AATCGTGAAG GCGCTTTATT GGGGCGCCGA CTTGCTAATC
CTCGACGAGC CCACCACAGT CTTGTCCCCT CCCGAGGTGA AGTCCCTTTT CCAAGTAGTT
AAGAGCCTCA AACAAAAGGG GAAGTCCATT GTATATATCA CACATAAAAT ACCAGAAGTA
CTCGAGGTGG CTGATAGGGT CGCCGTGCTG AGACGTGGGG TAAAAGTCGC CGAGTTCAAG
CCACCGTACG ACGCCAAGAA GCTGGTGGAG GCTATGGTAG GCGAGCTTAA AACAGAGAGC
GTAGAGAGGT CGGGAGAGAC CGGCGAGAGG CCGGTGCTAG AGGTGGTGGA TCTCTGGGTC
TACGAAGGGG GGAGAGCCGT GGTCCAGGGC GTTAACCTGG TCGTGAGGGA GAGCGAAATT
TTAGCCGTAG TGGGGGTAGA GGGTAACGGA CAGGAGCACT TGGTGGAAGC CGTGGTAGGG
CTGAGGAAGT ACAAGGGTGT TGTGAAAATC CATGGGGGCT ACGCATATAT ACCTGACGAC
AGGCATAGAA AGGCCCTAGT CTTGGAAAAG ACGCTTGTGG AGAACGCGAT TTTGGGGAAG
GAGGCCGAGT TCTCTAGACG CGGCCTCATC TCTTGGAAAG ACGCAGAGAG ATTTACGGCA
AAACTAGTGG AGGAGTTCGG AATCGTGACT CCTGGGCCGT GGGCTTTCGT GAAGCAACTT
TCAGGCGGCA ACCAGCAGAA GCTTGTAGTG GGCAGGGAGC TGAGCAGAAA CGCCAAGCTT
ATAGTTGCAC ATCAGCCCAC GAGGGGGCTC GACGTGGCGA CAACGGAGTA TGTACAACAT
TTGTTGTTAA AGGCGAGGAA CAACGGCGCG GGGGTGTTGC TCGTCACAAG TGACCTAGAC
GAGGCATATA AGCTGGCCGA CACAATCGCC GTGATGTATC GCGGTAGGAT AGTCGCCATA
GGGTCTGTGG GAGAGATGGC TCTTGACGTA GTAGGGAAGA AGATGGCAGG GCTATGA
 
Protein sequence
MQVSLKEIHK IFSDGTHALR GVSLDIYPGE VLALLGENGA GKTTLMKILA GIYKPTSGEI 
YIDGKKARFK NAREALRLGI AMVHQHLSLI PGLTALENIA VLEGAGLGPI SGEVRKRAEA
IAAGLGFEID WDRDVEELPL GVRQRVEIVK ALYWGADLLI LDEPTTVLSP PEVKSLFQVV
KSLKQKGKSI VYITHKIPEV LEVADRVAVL RRGVKVAEFK PPYDAKKLVE AMVGELKTES
VERSGETGER PVLEVVDLWV YEGGRAVVQG VNLVVRESEI LAVVGVEGNG QEHLVEAVVG
LRKYKGVVKI HGGYAYIPDD RHRKALVLEK TLVENAILGK EAEFSRRGLI SWKDAERFTA
KLVEEFGIVT PGPWAFVKQL SGGNQQKLVV GRELSRNAKL IVAHQPTRGL DVATTEYVQH
LLLKARNNGA GVLLVTSDLD EAYKLADTIA VMYRGRIVAI GSVGEMALDV VGKKMAGL