Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1896 |
Symbol | |
ID | 5055511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1702159 |
End bp | 1703229 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640469445 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001154099 |
Protein GI | 145592097 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.881035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTC TAGCCAAGAC TCTCCTTGTT AAGGCCATAA CCCTGGTCGT GGTTTTGATA GGCGTGTTAG TGCTTCTAGC AGTAATAATG GGAGCCACCG GCCTCTCTGA TAAAATGCTA AACTCGATCC TCACGTCGGA GGTGCAGGAG TATAAACAAC AACTTTTAAG ACAGGGCAAA GACATAGCCG CAATAGAAAA GGCTGTTGAG GAGTTTAAAA AAGAAAGAGC GGCGGCGCTG GGAATAGACA GGCCTTGGTA CGAGAGGATG CCCCAGTTGA TCTACCGCCT CCTCGTCTTG GATCTCGGCA ATTCAAGAAC GTTGCAGTCG TCGTGGGGGT CTAACAGAAT AGCCGATCTT ATACTAGATC GACTGCCCAA TACCATAATT TTGACAACTA CCGGCATTTT ACTCACAGCG CTTGTAGGCA TATGGATGGG GCTGTACATG GGGGCTAATA TTGGGAGTAG AGCCGATAGG GTCGTGTCAA TTTTGTCGGC GGCGTCTTAT GCCTTGCCGC TGTGGTTTGT TGGTCTTGTC CTGATACTAG TGCTTGCCTA CGGCCCCAAG ATACTGTGGG GCGTCCAGAT ATTTCCGCCG GGAGGCATGG TATCTACGCC TCCGCCGAAG GAGCCTCTGG CCTATGTATT AGACGTAATG TGGCACCTTT CCCTACCTCT GCTTGCCTCC TTTATTGTCT TTTTTGGGAG CTGGGCCTAC ACTACTAGGA ACATTGTCTT CAGCGTATCA CAGGAAGACT TTGTTAACTT TGCCAGGGCA AAGGGACTGC CGGAAGATAT GGTAAGAAAC AGGTACATAC TTCGTCCCTC ACTTCCACCC ATCTTGACTA ACCTAATTCT GAGCCTCGCG GCCTCGATCT CGGGGTATAT CATTACGGAG AGGGTTTTCA ACTGGCCGGG CATGGGGTCG CTGTTCTACG CCGCAATAAC GGCACTCGAC GAGCCGGTAA TCTTCGCATT GACATACGTC TTTACGCTTG TGTATATAAT AGGGAGGTTT ATACTAGAAA TACTCTACGT CTTACTCGAC CCCCGAATTA GGTTATCATG A
|
Protein sequence | MASLAKTLLV KAITLVVVLI GVLVLLAVIM GATGLSDKML NSILTSEVQE YKQQLLRQGK DIAAIEKAVE EFKKERAAAL GIDRPWYERM PQLIYRLLVL DLGNSRTLQS SWGSNRIADL ILDRLPNTII LTTTGILLTA LVGIWMGLYM GANIGSRADR VVSILSAASY ALPLWFVGLV LILVLAYGPK ILWGVQIFPP GGMVSTPPPK EPLAYVLDVM WHLSLPLLAS FIVFFGSWAY TTRNIVFSVS QEDFVNFARA KGLPEDMVRN RYILRPSLPP ILTNLILSLA ASISGYIITE RVFNWPGMGS LFYAAITALD EPVIFALTYV FTLVYIIGRF ILEILYVLLD PRIRLS
|
| |