Gene Pars_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1701 
Symbol 
ID5054484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1535298 
End bp1536605 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content57% 
IMG OID640469244 
Productextracellular ligand-binding receptor 
Protein accessionYP_001153904 
Protein GI145591902 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.58174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA CATTGTATAT TGTAGGGGCT GTAGTAGTGC TCATTGCGAT TTTGGGCATA 
CTCCTCACCA TGGGCGGCGG CCAACAACAG ACAACAACCG CGCCGCCCTC CAGCACGCAG
ACGACCCAAC CCACCACCAC AACTAAGGTC TTTAAGCTGG GCGCCCTTCT GCCGCTGACA
GGCGGCTTTA GTAGCTACGC CAAGTTGGCC CAATGCGCCG CGGAACTCGC GGTAGATGAA
CTCAACTCCG AATATGCGAG CAAGGGATAC AAATTCGAGC TGTACGTCGA AGACACCCAG
CTTGACCCCA ACGTGGCGGC CCAGAAGCTA CAAAGCCTCT ACGCCAGAGG CGTGAGGGCG
GTCCACGCAG GCCTCACCAG CAGAGAGGCT TCCGGAGAGA AGCCATTCGC CGACCAGAAC
CACATAATCC TCTTCAGCGC GTGGTCTACG TCGTCGCTTC TAGCCTTGCC TAACGACTGG
CTGTACAGAA TCGTCGGCAC CGACGCCAAA CAGATCAGAG CCATAGGCGC CGTCCTGAAG
GAGCTCGGAG TAAAGAAGGT AGCTCTTGTA TACAGGAAAG ACGCCTACGG CGAGGGCCTA
TACCTGGAGC TCCAGAAAGA GGCAGAGAAG CGCGGCTTCA AGCTCGTCTC CGTCGCCGCC
TACGACCCCG ACCCAAAGGC CTTTCCACAA ACAGCTCCCG AGGCTGTGAA AAAAATATCC
TCAGAGGTTA AAGACTTAGT AGGCCCCGAC TTCGCCCTGG TCATCGTATC CTTTGAAGAC
GACGGCTCAG TGGTGCTAAA CGCAATAGGA CAAGACCCCG TACTCTCTAA GGCCAGGCTT
ATAGGCACTG AGGGCATGGC GTATTCGCCC ATACTGCTAC AAGAAGGCGG CAACGTCATG
GCAAATGGGA AGATCATAGG AACCGCCAAC TGGGCTCTGG CCACCACACC GGAGTATCAG
CAGTTCGCCC AGAAGTTCCG CGCTAAGTGC GGCGCCGAGC CCATAACCCC CGCCCCCCAG
TCCTACGACA TTATAAAAAT GCTTGGCGAA ATCATGGCCA CGATAGGGAC TGACGACCCC
GACAAGGTAC GCGCCACGCT TGAGCAGTGG GGCAAAGAAG GCCGGTACAA GGGAGCAACC
GGCACGGTGC TCCTCGACGA AAACGGCGAC AGGGCCAACC CCAGCTTCAT CCTCTGGGGC
GTCACAGTAA AGAACGGCAA GCCGCAGTAC ATCGACATCG GCTTCTACAA CTACGACAAA
GACACCATCG AGTTCACCGA GGAGGGCAAA CAGTACTTCT ACGGGTAA
 
Protein sequence
MKNTLYIVGA VVVLIAILGI LLTMGGGQQQ TTTAPPSSTQ TTQPTTTTKV FKLGALLPLT 
GGFSSYAKLA QCAAELAVDE LNSEYASKGY KFELYVEDTQ LDPNVAAQKL QSLYARGVRA
VHAGLTSREA SGEKPFADQN HIILFSAWST SSLLALPNDW LYRIVGTDAK QIRAIGAVLK
ELGVKKVALV YRKDAYGEGL YLELQKEAEK RGFKLVSVAA YDPDPKAFPQ TAPEAVKKIS
SEVKDLVGPD FALVIVSFED DGSVVLNAIG QDPVLSKARL IGTEGMAYSP ILLQEGGNVM
ANGKIIGTAN WALATTPEYQ QFAQKFRAKC GAEPITPAPQ SYDIIKMLGE IMATIGTDDP
DKVRATLEQW GKEGRYKGAT GTVLLDENGD RANPSFILWG VTVKNGKPQY IDIGFYNYDK
DTIEFTEEGK QYFYG