Gene Pars_1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1168 
Symbol 
ID5054211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1055509 
End bp1057059 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content57% 
IMG OID640468718 
Productextracellular solute-binding protein 
Protein accessionYP_001153391 
Protein GI145591389 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.731614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0353149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAG GCTTGTTGGT TACACTAGTT GCAATTGTCG TCGTCGCGAT TCTCGCCGTG 
TATCTCGCCA CACAGCCGCC GACGCAGCCA CAACCCACCA CCTCCCCAAC TACTACCTCT
ACTCCTCCCG CTACCACATC GCCGACGTCT CCGCCAACCT CTCCAACTAC TTTTCCGCCT
ACTCCGTCGC CTTCCACCAC TCAGTCACCG TCTCCGACGT CTACTTCGAC GCCTCCGCCT
ACTCAGCCGG CGCCTACTTG CGATAAGTTG GTGGTTTTGA CGAGGCACCC GACCGATATC
TTGGACGCCA CGCGCGACTT GTTCTTGAAG AGCGACGTGG CGAAGAAGTA CGGCATTAAG
GACGTGGTGT TTAGGCCTTT GGCCGCGGCG CAGTGGCGGC CGCTGATTGA GCGGGGCGAG
GCTGATGCGG CGTGGGGCGG AGGGCCCACC CTCTTTGACT CCTTGTATAA AGACGGACTA
CTCCTGCCTC TTGAAGGAGA TGAAGTAAAG GCCGCCATTG CCCAGATACC TAAAACCGTC
GCGGGGATGC CTATGATGCG CGTGGGGCCG GACGGCAAAG TGTACTGGGT GGCTTGGGCA
ATTTCCAGCT TCGGCATTAC GATCAACACT AAAGTGCTAA GGACAGCCGG CGTGCCTGAG
CCCAAGACGT GGACAGACTT GGCGTCTCTA GAATACGGCA AAGCCATATT AAAAGGGATG
CCAGTCACCG GCTTGGCGCA GTTGACAAAG TCGACGAGTA ATACCAGGAT TGCGGAGATT
ATTCTCCAGG CATATGGCTG GGACCAGGGC TGGGTGGTCA TCACGCTGAC GGCGGCCAAC
GGCAAGGTGT ATGGCGGAAG TGAGGCTGTA AGGGACGCGG TCATTGCCGG GGAGATCGGG
GCTGGGTGGA CTATTGACTT CTACGGCTAC ACGGCGCAGT TGCAAAACCC CGACACGAAG
TACGTAATTC CGCCGGATAC GTCGGTTAAT GGTGACCCCA TTGCCGTGGT TAAGAACACC
AAGTGCAGAG CTGCCGCTGA GGCCTTCGTG GCGTGGGTGA TTACAGAGGG CCAGGTGGTG
GTGTTCGACC CCAAGATTAA CAGAATGCCC GTCAACCCCA ACGCCTTCAA CACGCCTCAG
GGGAAGCAGA GGCCCGACCT AAAGAGCGTA TATGACCAGC TCTTCCAGCT TAAGACCATC
GAGTTCAACG ACACCCTAGC GCTTGCCGTG GAGAACGTTG TTATGTACTA CTTTGACGCG
GCGATCACCG ACAACATAGA CATCTTGCAA CAGACGTGGC TTAAGCTGGT AAAGGCTCTA
AACGACGGGA AGATTGACAG AACGAAGGCG GAGGCCTTGG CGCAGAGACT CGGCGAGCCG
GTGACCTTTG TAGACCCAGA CACGGGGCAG TCTGTCAAGC TGACGATGGA ATACGCCATG
AGGATAAACG ACAGGATTGG AACAGACTCG ACGTATAGAG ATAAGGTCTA TGCCGCATGG
AGAGACGCCG CGAGGAAGAA GTACCAGGAG GTAGCCTCCC AAATCCCCTA G
 
Protein sequence
MRTGLLVTLV AIVVVAILAV YLATQPPTQP QPTTSPTTTS TPPATTSPTS PPTSPTTFPP 
TPSPSTTQSP SPTSTSTPPP TQPAPTCDKL VVLTRHPTDI LDATRDLFLK SDVAKKYGIK
DVVFRPLAAA QWRPLIERGE ADAAWGGGPT LFDSLYKDGL LLPLEGDEVK AAIAQIPKTV
AGMPMMRVGP DGKVYWVAWA ISSFGITINT KVLRTAGVPE PKTWTDLASL EYGKAILKGM
PVTGLAQLTK STSNTRIAEI ILQAYGWDQG WVVITLTAAN GKVYGGSEAV RDAVIAGEIG
AGWTIDFYGY TAQLQNPDTK YVIPPDTSVN GDPIAVVKNT KCRAAAEAFV AWVITEGQVV
VFDPKINRMP VNPNAFNTPQ GKQRPDLKSV YDQLFQLKTI EFNDTLALAV ENVVMYYFDA
AITDNIDILQ QTWLKLVKAL NDGKIDRTKA EALAQRLGEP VTFVDPDTGQ SVKLTMEYAM
RINDRIGTDS TYRDKVYAAW RDAARKKYQE VASQIP