Gene Pars_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0446 
Symbol 
ID5055706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp389804 
End bp390982 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID640468011 
Productextracellular ligand-binding receptor 
Protein accessionYP_001152696 
Protein GI145590694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATTT CTGGTAGATA CGCGGCTGAG GGACAGTACT CGCTGTGGGG GGCGCTTGCT 
GTGGTGAACT GGTTTAACGA CAACGGCGGC CTCGACTGCG GCGGGAAGAA GGTCAAGGTT
AAGCTTATCT ACAGAGACAG CGAGTCTAAG CTTGAGCTGG CCCAGAGCAT AACGGAGTCG
CTCATTACCC AGGACAAGGT GCACTTCCTC CTCAGCCCCT ACGGCTCAGA CTTGGCACTT
GGAGTCTCGC CAATCGCCGA GAAATACGGC GTGTTGATGG CCGTGGTCGG CGCGTCCTCG
GACCGCATAT TCCAGCAGGG CTTTAAATAC GTCATCGGCG TCGCCGCGCC CGCCAGCCAG
TACATGGTTC CGGTTCTGGA CATGGTTGTG AAAACCGACC CCACAGTCAA GAAAGTGGCC
ATCCTGTATA GAGACAGCGA GTTTAACAGA CAGGTGGCAG AGGGCGCCAA GGCATATGCC
GAGAAGCTCG GCTTGCAAGT AGTCGTCTAC GAGGTCTACC CATCCTCGCC CAAGGATCTG
ACGCCGCAAA TTCTGAAGGT CAAGCAAGCC GCGCCAGATG TGATCATCGT GGCTTCGCAC
TTCGCCGACG GCCAGCTGGC AGTGCAACAG TTAGCAGAGC AGAAGGTAGA CGCCAAGCTC
GTCGCCCTTT CGGTTGCCCC CCTAGTGCCG GACTTCTACA AAGCGCTAGG CGCCAAGGCC
GAGTGCATAG TGGGGCCGTC CCACTGGGAG CCCGGCGTCA AGTACACGCC CGACCTTGCA
AAGCAGAGGG GCGTCGAGTG GTTTGGCCCC ACTAAGGAGG AGTTCATCAC ATATTTCAAG
AAGGTTGCTA AGCAGATGGG CGGCCAGGAG GTGGACCCCG GCTACCACGC GGCGTGGGCG
GCCGAGGGTG TATTGACAAT CCTATACGGC GTACAGAAGG CTATATCAAC CAAGTCCGAC
AAGGTGCTCG GCGCTTTGCA GAACGCGAGG TTTATGACGT TCTTCGGCGA GTTTAAACTA
GACCCCACCA CTAACCTAAA CGTGGCCCAC TCAATGGTGG TGATCCAGTG GCAGGGAGGC
AACAGATATG TTGTTTGGCC CGCCGTAGTT GCTGAGGGCA AGCTCTTCTA CCCAAGCCTG
ACCTGGGACG AAAAGGCGGC TGGGAAGCTC TGTAAGTAA
 
Protein sequence
MPISGRYAAE GQYSLWGALA VVNWFNDNGG LDCGGKKVKV KLIYRDSESK LELAQSITES 
LITQDKVHFL LSPYGSDLAL GVSPIAEKYG VLMAVVGASS DRIFQQGFKY VIGVAAPASQ
YMVPVLDMVV KTDPTVKKVA ILYRDSEFNR QVAEGAKAYA EKLGLQVVVY EVYPSSPKDL
TPQILKVKQA APDVIIVASH FADGQLAVQQ LAEQKVDAKL VALSVAPLVP DFYKALGAKA
ECIVGPSHWE PGVKYTPDLA KQRGVEWFGP TKEEFITYFK KVAKQMGGQE VDPGYHAAWA
AEGVLTILYG VQKAISTKSD KVLGALQNAR FMTFFGEFKL DPTTNLNVAH SMVVIQWQGG
NRYVVWPAVV AEGKLFYPSL TWDEKAAGKL CK