Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0961 |
Symbol | |
ID | 5055929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 852429 |
End bp | 853391 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640468517 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001153193 |
Protein GI | 145591191 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.233776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000990289 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGGCCC TAGTAAAGGC CGAGAGACTA AGCAAGTACT TCCCAGTAGG CGGGTTGCTG AGACCGCGCG GCTACGTCAA GGCCGTCGAC AACGTTGACT TGGAGATCTA CGAGGGGGAG ACCCTTGGGC TGGTTGGCGA GACGGGGAGT GGGAAAACCA CTTTAGGGAG ACTCATACTT AGACTAATAG AGCCCACCTC GGGCAAGATA TACTTCGACG GCGCCGACGT CACGAAACTT TCAGGTAAGG AGCTGGCCAC GTTTAGGAGA AAGGCTCAGA TAATATTCCA AGATCCCTAC ATGTCGCTTA ACCCGCGCTT TACTGTTTAC CAAACACTCC TCGAAGTAAT TAAAGTTCAC AAACTGCCTA TACAAGACCC AGAAGAGCAC ATAGGCAAAA TGCTGGAGCT TGTGGGGCTC GAGAGGAGCC ACCTCCACCG CTACCCGCAC GAGTTCAGCG GAGGCCAGAG ACAGAGAATT GCGATACTCA GGGCACTGAT ACTTGAGCCC AAGTTCCTAG TACTCGACGA GCCGACCTCG GCGCTCGACG TGTCGGTACA GGCCCAGATT TTAAACATGT TGAAGGACCT TCAGAGGCGC CTCGGCCTTA CATACTTATT CATTAGCCAC GATATAGGAG CCGTCCGATA CATGAGCAAT AGAATTGCAG TAATGTATAT GGGCAAAATC GTGGAAATAG GCCCCGTAGA CGCCGTAATT AAGGAGCCTT TACACCCATA CACCCAAGCC CTCATTTCGG CGCTTCCAGT GCCAGACCCC AAGATAGCTA GGAGCAAAAA GACGGTGCTC TTGCAAGGAG AACCGCCAAG CCCAATAAAT CCGCCTGCCG GTTGCCGCTT CCACCCCCGC TGCCCCTACT TCATAAAAGG AAAATGCGAC GTGGAAGAGC CCCAGTTGAA AGAGGTAAAA AGTGGCCACT ACGTCGCGTG TCACCTATAC TAA
|
Protein sequence | MKALVKAERL SKYFPVGGLL RPRGYVKAVD NVDLEIYEGE TLGLVGETGS GKTTLGRLIL RLIEPTSGKI YFDGADVTKL SGKELATFRR KAQIIFQDPY MSLNPRFTVY QTLLEVIKVH KLPIQDPEEH IGKMLELVGL ERSHLHRYPH EFSGGQRQRI AILRALILEP KFLVLDEPTS ALDVSVQAQI LNMLKDLQRR LGLTYLFISH DIGAVRYMSN RIAVMYMGKI VEIGPVDAVI KEPLHPYTQA LISALPVPDP KIARSKKTVL LQGEPPSPIN PPAGCRFHPR CPYFIKGKCD VEEPQLKEVK SGHYVACHLY
|
| |