Gene Pars_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1865 
Symbol 
ID5055882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1667542 
End bp1668906 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID640469411 
Productextracellular solute-binding protein 
Protein accessionYP_001154068 
Protein GI145592066 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA ACCTTCTTAT AGGCACAGTG GTAGTCCTAT TAGCCATTCT TGCCGCTGTC 
GGTTATATAA TGTCACAGTC ACAACCTGCA CAAACTCCCA CGACCCAATC ACCGCCGCCA
GCTCCCTCAA CGCCGACGAC CACACCGAGC CCCACACAAA CCCCCACGGC TCCCCCCACA
CCCACCTCAC AACCAACGAC AACTACGCCG ACTCCCCCCC CACAAACTCC CAGCCCCACG
CCGAGCCCAA GCCCAACGCA ACCGCCTGCC CAGAAAGTGA TAATTAAGAT CTGGCACGCC
CTAAACCCAG AAGAGGAGGC CGTGTTCAAG GAGATCGCCT CGCTGTATAT GAAATCAAAC
CCAAACGTCC AGATAGTGTT TGAAAACAAG GCCCCCGACC TACAAACCGC CGTATTGGCG
GCAGTGGCTA CAGGTGAGAA ATTCGACTTG TTCATATGGG CCCACGACTG GGTTGGGCTA
ATGGTAGAGG CAGGGGTGTT AAAGCCTGTA GATAATGATG TGGCCGACGT GTTGCCTAAA
TTCGCAGTGC CGATCCCGCA GTACCAAGGA CACGTATACG GTTTGCCCTT TACGGCTGAG
ACAGTGGCGC TGATCTGCAA CAGGAAAATG GTGCCGGGGC CACCCAAGAC TTTTGATGAG
CTACTCGGTA TTATGAAGAG TTACAACAAT CCGCCGAAGA CATACGGCAT AGCTTATGTG
GTGAACCCAT ACTTCATATC AGCGTGGATC CACGGAGCCG GCGGCTACTA CTTTGACGAC
AAGACAGAGA AGCAGGGACT AAACAACCCC AAGTCTATAG CGGGGTTTGA GTTCTTTAAG
AAGTACGTAA TGCCATATGT TGGGCCCAAC CCCACGGACT ACAACACGCA AGTTAGCCTA
TTCCTAGGCG GACAAGCCCC CTGCATGGTA AACGGGCCGT GGAGCATAGG CGCTGTGAAG
AAAGCCGGCA TAGACTTCTT CGTGGCGCCA CTCCCGCCGG TGAACAACAC ATTTGTGCCC
AGGCCGTACG GCGGATTGAA GATGTTCTAC GTCACGATCT ACGCCTCGAA AGAGGCTATC
GACTTCATGA AGTGGTTCAC CACAGATCCC CAGGTGGCTA AAATCTTGGT GGACAAGCTG
GGCTACGTAC CTGTTATAAA GGACGTCCAA ATTCAAGACC CAGTGGTACA AGGCTTCTAC
GAAGCTATTA AGAACGTCTA CTTAATGCCC GTGTCTCCGA AAATGCAACC GGTGTGGGGC
GCCGTCGATT TAGTAATACA AAACGCCATA GTGTCCGACC AAAAGACCAT CCCTCAGGCG
ATCAACGACG CAGTTAAGGA TCTCTGCGCC AGAGGCCTCT GTTAA
 
Protein sequence
MNKNLLIGTV VVLLAILAAV GYIMSQSQPA QTPTTQSPPP APSTPTTTPS PTQTPTAPPT 
PTSQPTTTTP TPPPQTPSPT PSPSPTQPPA QKVIIKIWHA LNPEEEAVFK EIASLYMKSN
PNVQIVFENK APDLQTAVLA AVATGEKFDL FIWAHDWVGL MVEAGVLKPV DNDVADVLPK
FAVPIPQYQG HVYGLPFTAE TVALICNRKM VPGPPKTFDE LLGIMKSYNN PPKTYGIAYV
VNPYFISAWI HGAGGYYFDD KTEKQGLNNP KSIAGFEFFK KYVMPYVGPN PTDYNTQVSL
FLGGQAPCMV NGPWSIGAVK KAGIDFFVAP LPPVNNTFVP RPYGGLKMFY VTIYASKEAI
DFMKWFTTDP QVAKILVDKL GYVPVIKDVQ IQDPVVQGFY EAIKNVYLMP VSPKMQPVWG
AVDLVIQNAI VSDQKTIPQA INDAVKDLCA RGLC