Gene Pars_1732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1732 
Symbol 
ID5054786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1559222 
End bp1560187 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content52% 
IMG OID640469275 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001153935 
Protein GI145591933 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.561922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00101303 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGTTGC TGGAGGTTAA AGAACTAAGG ACGTGGTTCC CTGTGAAAAA AGGTCTCTTC 
GGCCCTACTA GGTATGTTAA GGCGGTAGAC GGGGTGAGCT TCACGCTGGA GAGAGGAGAG
GTGCTGGCAG TGATAGGCGA GTCGGGATCC GGCAAGACCA CACTGGGGAG GACTGTGCTA
AGACTGATAA AGCCCACTGG CGGGAAGATA ATATTCGAGG AGAAGGACAT AACCAATACG
CCTGAGAGCC AGCTAAGGTG GTATAGGTTT TCCACTGCTA TGGTTTTCCA AGACCCCTTC
AGCTCGTTGA ACCCCTACCA CACAGTGCAG TACATTTTAG AAGAGCCGCT TATATTGAGG
GGGGTACCGC CGGAGGAAAG GCACGAGCTT GTAGTGAAGG CGCTGGAGGA GGTAAGGCTA
ACGCCGCCGG AGGACTTTCT CAAGAAATAT CCGCACATGC TTAGCGGAGG CCAGAGGCAG
CGTATTGGCA TTGCCAGGGC GTTGATCACA CGGCCTAAGT TCGTAGTGGC AGACGAGCCT
GTATCTATGC TGGATGTTTC AATCAGAGCT GAAATACTAT CCCTTATGAG GAGTCTGCAA
GAGAAGTACG GCATCACAAT GATATACATC ACACACGACA TTGCCACTGC CAAGTATTTG
TCAGACAAGA TCTTGGTAAT GTACGCCGGG AAGATGGTGG AATACGGGCC GTTTAGAGAT
GTCATAAAAG AGCCTCTACA TCCGTACACC CAAGCGCTGA TCGAGGCTCT GCCCGACCCT
GACCCTACAA ATAGGTTTAG AACTAGGAGG GTGCCGCCGG GCGAGCCGCC AAGTCTCATT
AATCCTCCGC CTGGCTGCCG CTTCCACCCC AGATGCCCCT ACGCCATAAA AGGCAAATGC
GAAAAAGAAG AACCGCCCTT TATTGAGGCG AAGAAAGGTC ACTACGTCGC TTGTTGGCTT
TATTAG
 
Protein sequence
MPLLEVKELR TWFPVKKGLF GPTRYVKAVD GVSFTLERGE VLAVIGESGS GKTTLGRTVL 
RLIKPTGGKI IFEEKDITNT PESQLRWYRF STAMVFQDPF SSLNPYHTVQ YILEEPLILR
GVPPEERHEL VVKALEEVRL TPPEDFLKKY PHMLSGGQRQ RIGIARALIT RPKFVVADEP
VSMLDVSIRA EILSLMRSLQ EKYGITMIYI THDIATAKYL SDKILVMYAG KMVEYGPFRD
VIKEPLHPYT QALIEALPDP DPTNRFRTRR VPPGEPPSLI NPPPGCRFHP RCPYAIKGKC
EKEEPPFIEA KKGHYVACWL Y