Gene Pars_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1283 
Symbol 
ID5055211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1161073 
End bp1162302 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content56% 
IMG OID640468830 
Productperiplasmic binding protein 
Protein accessionYP_001153499 
Protein GI145591497 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.591849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGG TTCTTTGGGC CTCCGTCCTA ATAATCGCTG CCGTCCTGAC AATCGCATTG 
TATCTAACAC TGCAGGCTCT GCCCTCCTCC CAGCTTCCCC CGCAGACGGC GTCGGCGCCT
TCGTTAACGT CGCTCACGAC GTCCTCCGCG ACCGCGACGC CCTCATCAAC GTCATCCACT
ACGTCTTCCG CTACTCCGCC GTCTGTGGCT TCTCCTCAAT TCCTCGTTAT CAAGGACGTA
GTGGGTAGAG AGGTCAGGAT AAGGGTGCCA GTGAAAAGAG CAGTGGCTAT GGGCCCCGGC
GCCTTGCGCC TGGTGGTGTA CCTCAACGCC ACGGATATAT TGGTGGGCAT TGAGGCAATG
GAGAAAAGGC CTCCTCAAGG GAGGGATTAC GGCTACGTTA TATGGGCCAA GAATTTAACG
AACCTGCCAA TCGTAGGCCA GGGAGGCCCC GACAGCCCCG TGAACTTCGA GGCTATAATG
GCCGTGAAGC CCGACGTCAT AATAATGACG CCTGTATTGG CGAATACGCC GGACGAGGTG
CAACAGAAGA CCGGCATACC CGTGGTGGTC GTGTCCTACG GCACGACGGG CTCCATCAAC
TTCACGGAGC TTTTCTACTC GCTGAAGGTC TTGGGCAAGG TTCTGGGCAG GGAGCAGAGA
GCTGAGCAGT TGATCGCCTA CATGAAGTCG CTTATAGCAG ACTTAAAAGC GCGCACAGCC
AACATCACTA ATAGGCCCAC TGTATACGTC GGCGCTGTAT CCTTCAAGGG AGGCCGGCCT
TTCACCAGCA CCCAAGCTGG ATTCCCGCCG TTAGTCTTCC TCAACACGCC CAACGTGGCC
GATAAATACG GCATAAAGCC GGGCGCCCAG ATAAGCTGGG AGGCCCTCCT CCAAGCCCAG
CCAGACGTCG TCTTCGTGGA TTTGGGCAAC TACATGACCG TCTTACAAGA CTACAACAGA
TCGAGGGATC TCTACTGCTC CCTAAAGGCG TTCAAGGAGG GCAGAGTATA CGGCATATTG
CCCTTCAACT TCTATTGGAC CAACATAGCC ACGATGTTCG CCGACGCCTA CTACATGGGC
AAAGTGCTCT ACCCAAACCG CTTCGCCGAT GTGGACCCAA TCGCCAAGGC TAATGAGATC
TACGAGGTGT TCCTAGGAAT GCCCTTATAT CATAAAATAG CCAAGGACTT CGGCGGGGGG
TTCAGGAAGC TGAGCTTCCC ATGCGGATAG
 
Protein sequence
MGKVLWASVL IIAAVLTIAL YLTLQALPSS QLPPQTASAP SLTSLTTSSA TATPSSTSST 
TSSATPPSVA SPQFLVIKDV VGREVRIRVP VKRAVAMGPG ALRLVVYLNA TDILVGIEAM
EKRPPQGRDY GYVIWAKNLT NLPIVGQGGP DSPVNFEAIM AVKPDVIIMT PVLANTPDEV
QQKTGIPVVV VSYGTTGSIN FTELFYSLKV LGKVLGREQR AEQLIAYMKS LIADLKARTA
NITNRPTVYV GAVSFKGGRP FTSTQAGFPP LVFLNTPNVA DKYGIKPGAQ ISWEALLQAQ
PDVVFVDLGN YMTVLQDYNR SRDLYCSLKA FKEGRVYGIL PFNFYWTNIA TMFADAYYMG
KVLYPNRFAD VDPIAKANEI YEVFLGMPLY HKIAKDFGGG FRKLSFPCG