Gene Pars_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1764 
Symbol 
ID5055674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1582955 
End bp1585330 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content48% 
IMG OID640469309 
Producthypothetical protein 
Protein accessionYP_001153967 
Protein GI145591965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0673103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000629319 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACCGA TATATCGGTT AATATTAATA ATAGCCTTTG CAATGATAGC GAATGCCCAG 
GTAGCTACGC CTAGTATGCA GCAGTCTTCT TTTAACGTAG GCGTATCGTA CTCCCCGCCG
TATATATATC CCGGCTCTAT TGTCCAACTA TATATAACAG TTATATCTAT ACAACCTGTG
TCTAACTTAT ACATAGATAT AAATTCCCCT TTTAAAGTAT TAACAGGCTC TACAATTCAA
ATAAAACAAA TATCACCCGG CACGCCGGCA ACGTTCACAG CAGTTATTCA GGTACCCCTC
GACGCGCGGC CTGGATACTA CACTATTAAG GTAGTAGCGT ATACTCTTAT GTACTCGGCA
GAGTCAAGCA TCGATATTGA AGTGCTGCCC TTTGACTTCT CTTCTCTTGT CGTGGCAAGG
CCTATGGCTT ATCTACCTGG CCAGCCTGTT CAGCTACCCG TTTTGCTATT TAACCCCACG
GCTGACTACA TAAGAGCCCG CGTGGCAATA AATGGGTCTG CCGTTTCACA ATACCTAAAC
TCTTCGCTTT CGTGCGATAC TCTTATACCT CCTAGGTCCA ACTCTACGTG TTTTCTAAAC
TTTATAACAC CTAGCGGGTT GAAGCCGGGG TTCTACAACG CTACCTTAAC AGTTACTTTG
AGTAGCATAT CGGGGTACGC CGGTGCTGTA ACTTTTAGAA AGACGGTACA ACTGCCGATC
ATCAGCGGCG TTGACGTGAA CGTGATGGCG TCGCCCACGC AGCCTGTAGT TGTAGGCTCC
CCCACATTGG TGACGTTGGC GATATCTCAG GGAAGTTCAG TGCCTTTGCA AAACGTCACT
ATTCGAGTCC TAAGAGGCGA TGGTGTAGAG GTCTTAAACT CCAACCCAGT TACCGTACCT
TTGCTCACGC AAATACAGCT ACCCGTCCAG CTTGTAGTAA GTAAGTACGG GAATGTGAAG
ATACCTGTGG AGATATGTCA CTACTCGTCC TGCGGTATTA AATACGCCGA GTTGTATGTG
CCAATCCCAG GGCTTTCCGT AAATGCTGTT TTTACCCCGT TTCAAGGATA TCCAAGCTCG
TTGGTGCAGT CTACGTTCAT CATAGCGACG AACTACACGA TATCCAATGT CGCCGTGGAA
ATCCGCGCCC CATTCAAAGT GTTGACAAGT CCCGCTGTTT CGTTGCCGTT TCTGTCTCCC
CAGTCTCCCG CCACTATAAA CGTGGTTTTT GAAATTCCTA AAGAGACCAA GCCTGGGGTT
TACCCCGTCA ATGTGACAGT AGCCGGCGCC GTGTATAAGT TTTACTACGA AGTATTGAAA
CCTGAATTTA GCGTCGTGGC GACGTTTAAC CCGCCGGTAG CGTATCCAGG TGCCGTTGTT
TCCGGCAACC TGGTCGTGAC TTCTCCTTTT AATGCAAAAG ACCTAGATAT ATCCATATCG
ACTCCTCTAT CTCTCATATC TCCGGCAGAG TACCGCTTAC CCTATCTGCC GCAGGGACAG
CCATACACGG CTTATATAGT CCTGCAAGTG CCTGAGGGAA CGCCGCCTGG GAAGTACCCC
CTTACGGTGA CTGTAAATGG GGAGAACTAC ACGTTCTACG TCAACGTCGG CGCGCCAAGT
GTGGTAATTC AAAACGTAGT AATTACCCCT CCTAAAATAC TTGAAGGCAT TGCTACGGCG
CAAGTGGCTG TACAAGTCCT AAATACGGGC CCCGTCGTTG CGCGGAATGT CACAATTACA
TTGTTAAACT CTACTATTGG GAAGCAGAGC TATACGTTGG ATTACCTCCC GCCCGGATCG
CCGATCACTT TGACGTACTA TGTCGATATT TCAAAACTCC CACCTGGCCA TTACGACGTT
GTTGCGCAAG CTAGTTGGAG CGGCGGCGTT TACGTAGCTA GGGGGTCTCT AGATGTGGCT
AAGAAAAACG CCCTTAAAGT GGACTACAAG GTATACAACG TCGCGCCAGG CTCAACAGCG
GTTTTAGTAC TTAACATAAC GAATCTAGGC CCCGACGATG TTAAGAATTT AAGAGTGTCG
TTTACCCCCT CGCAAGTATT TGAACTACAC GCCTCTAATA TCGCAGACGT GGCGACTGCC
GGCGTTAGAA TGCTGGGAGA CCTGGCGCCG GGAGCCTCTG TCAGTACTGC GTTTTTACTA
GATGTGTCCG ATAAGGCGGT TCCCGGGATC TACGCTATAA CGCTGGTGGC TACGTGGAAC
CAGACAGGCG TCTTTATGCC CAGCGTGCAG TATATAAACA TCCCTATTGA AGTGAAAAGC
GGCATCGACA TGTTTGTCGT TATACCGCTT GTCTTGACCA TATTGCTCAT CGTAGTGGGG
CTGGTTACTG TAGCGCGTAG AAGGCGCCGT GGATAG
 
Protein sequence
MKPIYRLILI IAFAMIANAQ VATPSMQQSS FNVGVSYSPP YIYPGSIVQL YITVISIQPV 
SNLYIDINSP FKVLTGSTIQ IKQISPGTPA TFTAVIQVPL DARPGYYTIK VVAYTLMYSA
ESSIDIEVLP FDFSSLVVAR PMAYLPGQPV QLPVLLFNPT ADYIRARVAI NGSAVSQYLN
SSLSCDTLIP PRSNSTCFLN FITPSGLKPG FYNATLTVTL SSISGYAGAV TFRKTVQLPI
ISGVDVNVMA SPTQPVVVGS PTLVTLAISQ GSSVPLQNVT IRVLRGDGVE VLNSNPVTVP
LLTQIQLPVQ LVVSKYGNVK IPVEICHYSS CGIKYAELYV PIPGLSVNAV FTPFQGYPSS
LVQSTFIIAT NYTISNVAVE IRAPFKVLTS PAVSLPFLSP QSPATINVVF EIPKETKPGV
YPVNVTVAGA VYKFYYEVLK PEFSVVATFN PPVAYPGAVV SGNLVVTSPF NAKDLDISIS
TPLSLISPAE YRLPYLPQGQ PYTAYIVLQV PEGTPPGKYP LTVTVNGENY TFYVNVGAPS
VVIQNVVITP PKILEGIATA QVAVQVLNTG PVVARNVTIT LLNSTIGKQS YTLDYLPPGS
PITLTYYVDI SKLPPGHYDV VAQASWSGGV YVARGSLDVA KKNALKVDYK VYNVAPGSTA
VLVLNITNLG PDDVKNLRVS FTPSQVFELH ASNIADVATA GVRMLGDLAP GASVSTAFLL
DVSDKAVPGI YAITLVATWN QTGVFMPSVQ YINIPIEVKS GIDMFVVIPL VLTILLIVVG
LVTVARRRRR G