Gene Pars_2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2342 
Symbol 
ID5054894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2094008 
End bp2095441 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content55% 
IMG OID640469894 
Productextracellular solute-binding protein 
Protein accessionYP_001154538 
Protein GI145592536 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.936887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0125511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAT CTCAAGTAGT AGGGATTGTT GTCGCGTTGG TTGTTGTGGC GTTGTTGGGG 
TATTACCTCG GTTCTATGTC GCAACAGCCG CAAACTACGG CGCCGCCTTC ACAGACTACG
GCATCGCCTC AGCCGACCAC ACCGTCGGCC CCTAAGAAGA TTACGTTTTA CACCTGGTGG
GCTGGTCTTG AGAGGTTTGC CATCGATGCC GTTATTGGAA ACTTTACTAA AAAGACCGGC
ATCGCTGTAG AGAAGACGGC GGTGCCTGGC GGCGCCGGTG TGAATGCCAA GTTTGCAATC
TTGGCGCTGA TGCAGGCGGG GAATCCGCCG GCTGCATTCC AAGTACACTG CGGCCCCGAG
ATGTTGAGCT ATATCTACGT TGCACCGAAG GGCGAGGCCA GCTTTATGGA GCTGAGCAAC
GTGGCCAAGG AAATAGATAT GTCTACCCAA GCGCCTGACG TTCTGTCTGC GTGTTCCCTC
AACGGTAAGG TTTTCGCGCT TCCCGTCAAT ATCCACAGGG CTAACCTCAT CTTCATAAAC
AAACAAGTTC TCGACAAACT CGGCGGCTCT GTCCCCACAA CCCTAGACGA TTTGGTTGCT
CTCTGCAAGA AGGCCTCTGC CGCCGGGATG CCATGCCTAA TCCAAGCGGG CGCAGATCAG
TTCACAGTGC TACACCTCTG GGAGCAAGTT TTCCTTGCGG TGGCGGGGCC CCAGAAGTTT
ATAATGTTCA TGTACGGCAC TCTGTCACCC GATGACCCCT CGCTAAAACA AGCCACGGAG
AAGTTCTTAG AACTAGCTAA GTCATTCCCA GCTAACTGGC CAGCCCTAGA CTGGACAGGC
GCCGTCGCTG ACTTAGTTGC GGGCAAAGGG CTGGCTCACG TAGACGGGGA TTGGGTAGTG
GGCCTAATAT ACAACGTGTA TCCCCAGGTG AAGATGTGCC CCTACACCTC GATAACTCCA
GACTGTAACA TAGTTGTCGC GCCGTTCCCC GGGACCCAGG GAGTTTACAA TCTGGTGATA
GACTCAGTCG CGGTGCCGGC CGGCGGTCCG ACCACGCAAT TGGGCATAGA GTTTGTCAAG
TTCTTCGCGG GGCCAGAGGG GCAGTCTATC TTCAACCCAT TAAAGGGCTC AATAGCAGTG
TACAAGAACA TAGATCCCGC AATATACCCC ACCCCAATAC AGAGGTGGGA AGTACAGGAG
TATAGAGACG CCAGATCCTA CGTGTTCAGC CTTACCCACG GCGCGTTGTT CTCAGACGTC
TGGCAGATGT TGTTGCAACA AGCCATAGTC CTTGTGCAGA CGGGGAGGGC AGATCTGTGG
TACGACACCT TGTCAAAAGC GCTGTCCACA GAGCGCTCCC TGTGGAAGGA CACCTGGTAC
CTCGGCGCGC CGGGTAAGCC CTTCGGCGGC TACCAACCGC CGTGGGTTAA ATGA
 
Protein sequence
MKLSQVVGIV VALVVVALLG YYLGSMSQQP QTTAPPSQTT ASPQPTTPSA PKKITFYTWW 
AGLERFAIDA VIGNFTKKTG IAVEKTAVPG GAGVNAKFAI LALMQAGNPP AAFQVHCGPE
MLSYIYVAPK GEASFMELSN VAKEIDMSTQ APDVLSACSL NGKVFALPVN IHRANLIFIN
KQVLDKLGGS VPTTLDDLVA LCKKASAAGM PCLIQAGADQ FTVLHLWEQV FLAVAGPQKF
IMFMYGTLSP DDPSLKQATE KFLELAKSFP ANWPALDWTG AVADLVAGKG LAHVDGDWVV
GLIYNVYPQV KMCPYTSITP DCNIVVAPFP GTQGVYNLVI DSVAVPAGGP TTQLGIEFVK
FFAGPEGQSI FNPLKGSIAV YKNIDPAIYP TPIQRWEVQE YRDARSYVFS LTHGALFSDV
WQMLLQQAIV LVQTGRADLW YDTLSKALST ERSLWKDTWY LGAPGKPFGG YQPPWVK