Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1895 |
Symbol | |
ID | 5055528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1700747 |
End bp | 1702162 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640469444 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001154098 |
Protein GI | 145592096 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAGAG AATTTCTAAA AAGCCCCTCG GGTCTGGCAG GTACGATAAT TTTAGTCACT TTTTCTCTGA TATCGGTATT TGTTGTTATG ACATTTCCGA TGGATTACGG TACCAAGTAC TGGTCAAATC CAAAATACTG GGAGGACTAC CCTAAGCTGG TGCCTCCGGT GTGGTACAAC TACTTTGTAC CGTATAAACT GCCACAGCAC TTTGTAAAAG ATCTAACTAG CCCCTCAGAG ATGGTGGATA ATGTAAAAAA GTGGGTTGTT GAGTATAAAT TTGAGGCTGA TAAGTTCCCC ACAGGGATTA TTTTCAAGTA CATCAACTTG ACGTTCTATG GCGACGTGCC GGTGATTTCG CTGAAAGTAA AAAGGCCAGA TGGCAAGGAG GTGGTTTTGT TAGACTACGA CGGGGGTATA CCACCGCCAA GGTCGGGCGA GAAGGCGCCG TACGTGCGGT TTGTGGAAAA CCCCAGAACC ATAATACTCA TCTCAGATCC TCAAGCAGTT AGAAAAGTTA CAGTATTTGC AGAAGAGCTG GGTGTAAACT GTACTACGGC AGATGTTAGA GACGCCGGGC TTCTCCCCTA CATAGTTTTT GGCACGCCTA TCTCAAGTAA GTGCGTCGCC GAGTCTTTCA AACCACTGAA AGGGAGCTAC GTATTCGAGG TGGAGATGGT GGGTGATCCT AAAGATGAGA TAGGGCTACT CCGCCTAGTA GTCCAAGGCG CTGTCTACGG CGCCATGGGC ACAGACTATT TAGGACGCGA CTTGGCACAA GGCCTCCTAT TCGGCTTTCC AGTCGCCTTA TTTATAGGTG GTGTAGTCGC GCTCCTCGCT ACGCTTATAG GATTAGCCCT CGGTATTATA AGCGGGTACA TAGGCGGCAA GGTCGACGAG GCAATACAGA GGTTCGCAGA CGTTTTGAAT AACCTCCCCT TGTTGCCTCT TTTAATACTG TTTGTCTTCG TGCTCGGCCG TAGCCTCTTT AACATTATAT TAGTGTTGGT CGTGTTCGGT TGGGCCGGTA CTACCATCAT TGTCAGGTCC ATGGTCCTCA GCATAAAGAC GAGCCAGTTC GTGGAGGCGG CTAAACTGGC GGGGGCAAGC CACTGGTGGA TAATGAGGAA GCACATACTG CCGCCTGTCT TGCCCTACGC CTTTGCCCTT ATGATCTTTG CAATACCCGG GGCGATACTG TCTGAGGCAA GTCTCAGCTT CTTGGGACTC GGCGATCCCA GCATCCCGAC ATGGGGGCAA ATTCTACAAC AAGCGTTTGA CAACGGCGCG TTGCAAAACT TTGCGTGGTG GTGGATACTA ACGCCCGGCT TCTTGATCGT TATCACAGCC ATAGGCTTTG TCCTGGTATT CTTCGCGCTG GAGCCTATTG TCAATCCGAG GTTGAAGAGG CAGTAA
|
Protein sequence | MIREFLKSPS GLAGTIILVT FSLISVFVVM TFPMDYGTKY WSNPKYWEDY PKLVPPVWYN YFVPYKLPQH FVKDLTSPSE MVDNVKKWVV EYKFEADKFP TGIIFKYINL TFYGDVPVIS LKVKRPDGKE VVLLDYDGGI PPPRSGEKAP YVRFVENPRT IILISDPQAV RKVTVFAEEL GVNCTTADVR DAGLLPYIVF GTPISSKCVA ESFKPLKGSY VFEVEMVGDP KDEIGLLRLV VQGAVYGAMG TDYLGRDLAQ GLLFGFPVAL FIGGVVALLA TLIGLALGII SGYIGGKVDE AIQRFADVLN NLPLLPLLIL FVFVLGRSLF NIILVLVVFG WAGTTIIVRS MVLSIKTSQF VEAAKLAGAS HWWIMRKHIL PPVLPYAFAL MIFAIPGAIL SEASLSFLGL GDPSIPTWGQ ILQQAFDNGA LQNFAWWWIL TPGFLIVITA IGFVLVFFAL EPIVNPRLKR Q
|
| |