Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1070 |
Symbol | |
ID | 5055110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 954535 |
End bp | 955962 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468626 |
Product | hypothetical protein |
Protein accession | YP_001153300 |
Protein GI | 145591298 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCCC TGGCCTTAGT CATACTTCTG TCGGCTTTAG CGTTGTCCGC CACAATTTTG ACTAGGGCTG ACCAAGCCTC AATTGATGCA GCTCTGCCGG TACTCCAGAA GTATATCCCC GACGTGAAGA TCATCCCGGC GGATCCTACC CAATGGCGGA CTTTAATAAG CAGGGGGGAG GTTGACTTCT TGTGGGCCGG CGCGCCGGTG CTCTATGAAG CCCTTCTAAA AGAGGGCTAC CTATCGCCTA TGCCCGACGT GCCGGAGCTT GGACAGGTGC CGGATAAGAT AGGCAATATC CAACTGAAGA GGGTGGGGCC GGACGGGAAG ACGTATTATG TTGTATACGC GCTCCTGGGC TACGTAATCG GCTACAGCCC TGAGTCACTC CGCAGAGCCG GGGTAGCCCC TGTCTGTACC TGGCGCGACT TAGCCTCGCT CAACTTAACC AAGTACTACC TCCAGACGGG GAGCAAGCCT CTGGCCATTG CTAAGCCAAC AAAATCTACT TCAACTGCAG CGATGATGCA ACTGGTGACG GAGATCTACG GCTGGGAGGA GGGGTGGAAG TACTTGGCGG CGATGGCTGC GATGGCCAGG TTTGTGGACT CCAGCGGCGC GGTTAGGGAC GCTGTGAAGA GAGGAGAGGC GCTATTTGGT CCAATGGTCG ACTACTACGT CTTCCTAGCT GGACTAGGCT ACTGCATCCC CACGGACGGG ACTGACGTCT TGTTCGACCC CATTGCCGTG CCCCGCGGCG CCAACACAAC CGTGGCCACT ATTCTGACTA GGGTGTTCCT CACAGAAATG GAGAAGAGAC TTGTGGACAG GTACCTCCTC CCCGGCAACC CGGCCGTGCT GGACTCCCCA GACGTGAACC AGACTAAGGC TGCGTTGCTC AAGCAACATC TGCAAAAATT GATGAGCTCC AAGATCATGT ATGTGCCGCC GGAAAGGAGC GCCTCGTATT ACTACTCCTT CATCTACTAC TACGAGGCTA CGTTGGTAGA TCTCCAAGAC CTTCTCACAG ACGTCTGGAC TAAGGCGGCA AACGCCTACT TGACCGGTAA AATCTCCAAG CAACAGTTCG AGGAGCTGTG GAAGAAGCTG GGGGCGCCAG TCAATTATAC AGACCCCGAC ACCGGCAAGA CAGCTACCTT CACCTTCCAG ACAGCGGTTG AGATAAACAG CAAGGTGGGC AGCGACCCTG TGTACAGAGA CAAGGTGTAC CAGGCGTGGC GCGAAGCGGC TAGGGCGAAG TACGAGAAAG TAGCCCAGGA GCTACAGCGC TACGTTCAGG AAAACGCCGC CACGCCCCAA CAGACCTCTA CGCCCCCACC CGCTTCCCAG GGCTACACCC CCATAATTGT GGCCGCCTTG CTGTTGGCGG TAGTGGCAAT ATCGCTACTA CTGCTGAGGC GCAGATAA
|
Protein sequence | MRALALVILL SALALSATIL TRADQASIDA ALPVLQKYIP DVKIIPADPT QWRTLISRGE VDFLWAGAPV LYEALLKEGY LSPMPDVPEL GQVPDKIGNI QLKRVGPDGK TYYVVYALLG YVIGYSPESL RRAGVAPVCT WRDLASLNLT KYYLQTGSKP LAIAKPTKST STAAMMQLVT EIYGWEEGWK YLAAMAAMAR FVDSSGAVRD AVKRGEALFG PMVDYYVFLA GLGYCIPTDG TDVLFDPIAV PRGANTTVAT ILTRVFLTEM EKRLVDRYLL PGNPAVLDSP DVNQTKAALL KQHLQKLMSS KIMYVPPERS ASYYYSFIYY YEATLVDLQD LLTDVWTKAA NAYLTGKISK QQFEELWKKL GAPVNYTDPD TGKTATFTFQ TAVEINSKVG SDPVYRDKVY QAWREAARAK YEKVAQELQR YVQENAATPQ QTSTPPPASQ GYTPIIVAAL LLAVVAISLL LLRRR
|
| |