Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0964 |
Symbol | |
ID | 5055848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 855416 |
End bp | 856414 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640468520 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001153196 |
Protein GI | 145591194 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000198009 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTCTT TCAGGAGGTT TCTACTTAGG CGTTTTTTGA CCTTCCTCCC GACAATTTTC GGCGTTGTTT TTATCACCTA TCTCATAGCT TACGCCATAC CTGCAGATCC CGCTAGGGCT TGGGCTGGCG GCGAGAAGGC AAAGCCTGAG GTAATTGAGC GGCTTAAGCA GTACTACCAC TTCGATAAGC CGTGGTACGA ACAGTTTTAC TACTTCCTCG TAAGGCTTTT CGAGGGGTCT TTAATAAGCC CACGAACCGG CAACACCGTA TTTGCAGATA TTGCGGCTAG GTTTTCTGTA ACGCTACAAC TGGCGCTTTT TTCGATATTT TTCTCAGTAG CGATTGGCTT GCCGTTGGGG CTTCTTGCCG CCTATAAGAG GGATACCAAG ATAGACACCG CGGTCAGAAT ACTAGCGCTA ATCGGCGTCT CTATGCCGGC GTTTCTTCTC GGCTATCTCC TAATTCTCGT GTTCTTTGTC CAATTTAAGG CCATTACCCT AGCCGGCGTC CCCACGGCAA AAGTCTCGAT AACTGGAATC CCGCTCATAG ACGCCTTGAT AACGCTGGAT TTCGACTCTC TTTCCCAAAT CGTCGGTCGG TATTGGTTGC CGGGATTCGT ACTCGGCTTC TCCGGGGCTG GGATAATAGC TAGGTTTGTG AGAAACTCCA CAGTAGAGGC GCTAGGCGCG GATTTTGTAG AGTATCTACA TGCAAAGGGG TTGTCCCCAG ACTGGGTGAG GAGGCATGTT TTCAAAAACT CGCTTGTGCC CATCGTCACA ATAATTGGCC TCGAATTCGG CGCGTCTCTC AGCGGCGCCC CTATTACAGA GACTATTTTT GGGCTACCCG GCCTGGGGGC CTACGCAGTG CAGTCTATCT ACTACCTGGA CTTCCCCGCT ATTATCGGCA CAACTTTTGT CTTCGCCATT ATCTACGTAG TTACTAATTT CGTAGTCGAT CTATTCTACG CATTTATAGA CCCAAGGGTG AGGTACTGA
|
Protein sequence | MSSFRRFLLR RFLTFLPTIF GVVFITYLIA YAIPADPARA WAGGEKAKPE VIERLKQYYH FDKPWYEQFY YFLVRLFEGS LISPRTGNTV FADIAARFSV TLQLALFSIF FSVAIGLPLG LLAAYKRDTK IDTAVRILAL IGVSMPAFLL GYLLILVFFV QFKAITLAGV PTAKVSITGI PLIDALITLD FDSLSQIVGR YWLPGFVLGF SGAGIIARFV RNSTVEALGA DFVEYLHAKG LSPDWVRRHV FKNSLVPIVT IIGLEFGASL SGAPITETIF GLPGLGAYAV QSIYYLDFPA IIGTTFVFAI IYVVTNFVVD LFYAFIDPRV RY
|
| |