Gene Pars_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2136 
Symbol 
ID5055873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1910820 
End bp1912148 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content54% 
IMG OID640469688 
Productperiplasmic binding protein 
Protein accessionYP_001154334 
Protein GI145592332 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTGT CAAAAGAGTT CCTGGTAGCG TCGGCGGCAT TTGCGCTTTC GATTATTGCT 
TTGGCAATAG CGGTACAGGC CTTGGGGCAG TTATCATCGT CGCTAATCTC TCTCAAGGCA
GATGTCACCG ACAAGTTAAA TATCCTGGAG AAAAAGGTGG GAGAGCTTCA GCAACAGTTA
AGCATATCCA CGAGCGAACT CAGAGGTTCT CTTCAGAGCG AAATCTCCGG CGTTAACAAA
ACTTTGTCTG AGCTTAGGCA GGAAGTGACG CTTCTTAGGC GTGTGGCGTC TATGCCCTCT
GGCCAAGTGT CTGTCAAATA CGCCAGCTTT TACCTGGCTT ACGAGGGCGG GGCCTACCTT
CTTAAGGACT CCATGGGACG GAGAGTTTTG CTACTGCCAC GGGGGATTGA AGCGCCTCTT
GCCTCGTATC TAGAAGCGAA GTATAGACCC GACCTCGTTG TGATTTACCC AGTAGAGAGG
GCTGTTTTTA TGGCCGCTAC CCAAGTTGCC ATGGTCTACC GGCTGTATAA CGAGACTGGG
GACCCGCGCT TCTTGAGGTC AATCGCCGGC ATTATGTGGG GCAGGGACTA CGAGTGGTAT
CTCCCCGAGG TTAAGGCCAT GCTCCAAAAC GGCACTATAA AAGACGTCGG ATCTGCCTAT
TCTCCAAACT ACGAGGCCAT ATTGGCCCTG AAGCCCGATG TTGTCTTCGT ATACTTCTCC
CCCGGCCCCT ACGGCACAGA GGCCGTAATT CAAAGGCTTC AACAGCTGGG CGTCCCCTAC
GTCGTCGTGA ATGAGTTCAA CGAGAGGAGC CCCCTGGGGA GGTTCGAATG GGTCAAGGCA
GTAGCCGCGT TTTTCAACGC GACGGACAAG GCAGTTGCGG TATTCAACAA GGTGGAGGCG
AGGTGGGACC AGCTGGCGTC CTTAGCCGCG GATTTAGACA GGCCGAGGGT GGCGTGGTTT
ATCATATACC AAGGCATTCT ATACCCCGCA GGTCCGGCGG TAAGGGAGCT AATAAGACTG
GCGGGGGGCA GATACGCCTA TGCCAACTAC AGCCGGGTTG ACCTCGAAGT CGTGTTGAAG
CACAGAAACG ACGTTGACGT GCTCATATGG TCGGGTTACG GCGTGTCAAA GATAGAGGAC
ATAGTGAAGA TAGAGCCGAG ACTCAAGGAA CTTAGGCCAG TCGTGACTGG CAGGGTGTAC
GCATACAGCC CCGCCTTTTA CCAGCTGTCC AACGCCTACC CAGAGCGCGT CCTTGAAGAG
CTCGTGTCGA TAATACACCC CGAGATCTCT CCGCCGGGTA GGCTAACGCT CTTCATCCAG
TTGCAGTGA
 
Protein sequence
MQLSKEFLVA SAAFALSIIA LAIAVQALGQ LSSSLISLKA DVTDKLNILE KKVGELQQQL 
SISTSELRGS LQSEISGVNK TLSELRQEVT LLRRVASMPS GQVSVKYASF YLAYEGGAYL
LKDSMGRRVL LLPRGIEAPL ASYLEAKYRP DLVVIYPVER AVFMAATQVA MVYRLYNETG
DPRFLRSIAG IMWGRDYEWY LPEVKAMLQN GTIKDVGSAY SPNYEAILAL KPDVVFVYFS
PGPYGTEAVI QRLQQLGVPY VVVNEFNERS PLGRFEWVKA VAAFFNATDK AVAVFNKVEA
RWDQLASLAA DLDRPRVAWF IIYQGILYPA GPAVRELIRL AGGRYAYANY SRVDLEVVLK
HRNDVDVLIW SGYGVSKIED IVKIEPRLKE LRPVVTGRVY AYSPAFYQLS NAYPERVLEE
LVSIIHPEIS PPGRLTLFIQ LQ