Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1168 |
Symbol | |
ID | 5054211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1055509 |
End bp | 1057059 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468718 |
Product | extracellular solute-binding protein |
Protein accession | YP_001153391 |
Protein GI | 145591389 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.731614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0353149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACAG GCTTGTTGGT TACACTAGTT GCAATTGTCG TCGTCGCGAT TCTCGCCGTG TATCTCGCCA CACAGCCGCC GACGCAGCCA CAACCCACCA CCTCCCCAAC TACTACCTCT ACTCCTCCCG CTACCACATC GCCGACGTCT CCGCCAACCT CTCCAACTAC TTTTCCGCCT ACTCCGTCGC CTTCCACCAC TCAGTCACCG TCTCCGACGT CTACTTCGAC GCCTCCGCCT ACTCAGCCGG CGCCTACTTG CGATAAGTTG GTGGTTTTGA CGAGGCACCC GACCGATATC TTGGACGCCA CGCGCGACTT GTTCTTGAAG AGCGACGTGG CGAAGAAGTA CGGCATTAAG GACGTGGTGT TTAGGCCTTT GGCCGCGGCG CAGTGGCGGC CGCTGATTGA GCGGGGCGAG GCTGATGCGG CGTGGGGCGG AGGGCCCACC CTCTTTGACT CCTTGTATAA AGACGGACTA CTCCTGCCTC TTGAAGGAGA TGAAGTAAAG GCCGCCATTG CCCAGATACC TAAAACCGTC GCGGGGATGC CTATGATGCG CGTGGGGCCG GACGGCAAAG TGTACTGGGT GGCTTGGGCA ATTTCCAGCT TCGGCATTAC GATCAACACT AAAGTGCTAA GGACAGCCGG CGTGCCTGAG CCCAAGACGT GGACAGACTT GGCGTCTCTA GAATACGGCA AAGCCATATT AAAAGGGATG CCAGTCACCG GCTTGGCGCA GTTGACAAAG TCGACGAGTA ATACCAGGAT TGCGGAGATT ATTCTCCAGG CATATGGCTG GGACCAGGGC TGGGTGGTCA TCACGCTGAC GGCGGCCAAC GGCAAGGTGT ATGGCGGAAG TGAGGCTGTA AGGGACGCGG TCATTGCCGG GGAGATCGGG GCTGGGTGGA CTATTGACTT CTACGGCTAC ACGGCGCAGT TGCAAAACCC CGACACGAAG TACGTAATTC CGCCGGATAC GTCGGTTAAT GGTGACCCCA TTGCCGTGGT TAAGAACACC AAGTGCAGAG CTGCCGCTGA GGCCTTCGTG GCGTGGGTGA TTACAGAGGG CCAGGTGGTG GTGTTCGACC CCAAGATTAA CAGAATGCCC GTCAACCCCA ACGCCTTCAA CACGCCTCAG GGGAAGCAGA GGCCCGACCT AAAGAGCGTA TATGACCAGC TCTTCCAGCT TAAGACCATC GAGTTCAACG ACACCCTAGC GCTTGCCGTG GAGAACGTTG TTATGTACTA CTTTGACGCG GCGATCACCG ACAACATAGA CATCTTGCAA CAGACGTGGC TTAAGCTGGT AAAGGCTCTA AACGACGGGA AGATTGACAG AACGAAGGCG GAGGCCTTGG CGCAGAGACT CGGCGAGCCG GTGACCTTTG TAGACCCAGA CACGGGGCAG TCTGTCAAGC TGACGATGGA ATACGCCATG AGGATAAACG ACAGGATTGG AACAGACTCG ACGTATAGAG ATAAGGTCTA TGCCGCATGG AGAGACGCCG CGAGGAAGAA GTACCAGGAG GTAGCCTCCC AAATCCCCTA G
|
Protein sequence | MRTGLLVTLV AIVVVAILAV YLATQPPTQP QPTTSPTTTS TPPATTSPTS PPTSPTTFPP TPSPSTTQSP SPTSTSTPPP TQPAPTCDKL VVLTRHPTDI LDATRDLFLK SDVAKKYGIK DVVFRPLAAA QWRPLIERGE ADAAWGGGPT LFDSLYKDGL LLPLEGDEVK AAIAQIPKTV AGMPMMRVGP DGKVYWVAWA ISSFGITINT KVLRTAGVPE PKTWTDLASL EYGKAILKGM PVTGLAQLTK STSNTRIAEI ILQAYGWDQG WVVITLTAAN GKVYGGSEAV RDAVIAGEIG AGWTIDFYGY TAQLQNPDTK YVIPPDTSVN GDPIAVVKNT KCRAAAEAFV AWVITEGQVV VFDPKINRMP VNPNAFNTPQ GKQRPDLKSV YDQLFQLKTI EFNDTLALAV ENVVMYYFDA AITDNIDILQ QTWLKLVKAL NDGKIDRTKA EALAQRLGEP VTFVDPDTGQ SVKLTMEYAM RINDRIGTDS TYRDKVYAAW RDAARKKYQE VASQIP
|
| |