Gene Pars_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1059 
Symbol 
ID5056257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp942836 
End bp944041 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content58% 
IMG OID640468615 
Producthypothetical protein 
Protein accessionYP_001153289 
Protein GI145591287 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCTG AAATTCTGCG GCTGGCGTGG CAGGCTTTGT GGGAAAGGAA GGGCCGCACC 
ATAGGGGCTG TGGTGGGGGT GGTCATCGCC TTCACCGCTT TGAGCTACGC CCTTCTGCTA
GGCCAGACGT TTAAAGACAA CGTTGCTCAT TACTTCACTT CCAACTTCCA GATGAACGTG
CTCTACGTGA TGGGGTCTCA GTTCACAGAT GCAGATGTCA GCACCATATC AACAATAAGC
GGGGTGGAGC TGGCTGTGCC CATAACCTCG GCGAGGGGTG CAGTGAGGGT GCCCGGCACT
TCGGGCCAGA CCCCCGTGAC CGTCTACGGC GTGCCCCCCT CTCTTATTTC CCAAGTTCTG
CCCCCCACGT CGCTGTACGA CGGGGAGCTG ATTGTAGGGT CAAACCTCGC CATGGTGGGG
TACTACGTGG CCTTTGACCG TTCTACTGGC CAGCAGAGGG TCGCAGTCGG GTCACCGCTA
TCCCTCGCTA TCGGCAGGAG GTCAACCACT GTGGTAGCCT CGGGCATAAT GGCCACTGGT
GCCTTGGGCT TTGTAGACAC TACGCGGGGG GTGGTGATGG ACATAAACAC CTTCCGCCAG
CTTACCGGCA TCACCACCTA CAATCTCGTG ATGGTGTACC TAAAGGACGT ATCCCAGATA
GACGCCGTTT CAAACGAAAT CAAGGCCAAC TTCCCCAACG TAGACGTGGT GTCGCCCCAG
GCCATCCTCC AGACAATAAA CAGCTTCCTA ACCGCCTTCC AGCTCTTCCT CGGCCTCATC
GCCGGGGTCA GCACCGTGAT CACCGCCCTT TGGCTATACG ACACCATGTC CATCAGCGTC
GTGCAGAGGA CAAAGGAGAT AGGGATACTG AGAGCCCTGG GCTTTAGGAA GATGGACGTA
ATGGCCATGT TCCTCGCCGA AGCCTTCATA ATAGCGGCTA TAGGAGTATT AGTAGGTCTC
CTCCTCATAA TTCCACTGTC CCAGATGGGG CTACCGCTGT TAGGGGGAAT GCAACAACAG
TCCATGTCGG CTGGCGGCGC CTTTAGGCCG CCCCAAGGGG GCTTTAACAT ATCGTCGCTT
GTGCTAGACC CCGTGGTCTT GGCCGCTACC GCGGCGCTCG TGGTGGCGAT AAACCTAGTC
GGTGCCCTCC TCCCAGCCTA CAGGGCAGGG AGACTCGACG TCGTGTCGGC GCTTAGGTAC
GAATAG
 
Protein sequence
MLPEILRLAW QALWERKGRT IGAVVGVVIA FTALSYALLL GQTFKDNVAH YFTSNFQMNV 
LYVMGSQFTD ADVSTISTIS GVELAVPITS ARGAVRVPGT SGQTPVTVYG VPPSLISQVL
PPTSLYDGEL IVGSNLAMVG YYVAFDRSTG QQRVAVGSPL SLAIGRRSTT VVASGIMATG
ALGFVDTTRG VVMDINTFRQ LTGITTYNLV MVYLKDVSQI DAVSNEIKAN FPNVDVVSPQ
AILQTINSFL TAFQLFLGLI AGVSTVITAL WLYDTMSISV VQRTKEIGIL RALGFRKMDV
MAMFLAEAFI IAAIGVLVGL LLIIPLSQMG LPLLGGMQQQ SMSAGGAFRP PQGGFNISSL
VLDPVVLAAT AALVVAINLV GALLPAYRAG RLDVVSALRY E