Gene Pars_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0947 
Symbol 
ID5056096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp835510 
End bp837579 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content57% 
IMG OID640468503 
ProductCBS domain-containing protein 
Protein accessionYP_001153179 
Protein GI145591177 
COG category[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGC AAGTAAGCTT TAGGCAAGTC GCCGGCGTCG CCGCCGTTGC GTGGTCCGGA 
GCGTTTTTAG AATGGGTCGA CTTCTACACA TATGCGCTGC TGGCGGGAAC CGTAGCTAAG
GTCTATTTCC CGTCGAAAGA CCCCATAGCC TCTCTACTTG CGTCATTCGC CGCATTGGCG
ATAGGATTTT TGTTTAGGCC TCTTGGGGCG ATTCTTTTCG GCAAAATCGG CGACCAGTTC
GGCAGAAAAG TTGCCTTTAT CACAGCGATG TCACTCATGC TGGCGGGCAC TCTCGGCATA
GGCCTGTTGC CGGGCTACGC CGAGATAGGG GTCTTGGCTT CCGTAGGCGT GTTCCTCCTC
AGAATAGTCC AGGGCCTAGC ACTGGGCGGG GGCTACGGCG CGGCCATCAC CTACCTGGGA
GAATTTGTGC CGGAACACCG CAGGGGGCTC TTCACCGGGT TCTTGTTCAC CACGCCGGCC
GCGGGCATGG CGACAGTCGG CGCTCTAATA TGGCTCTTCT CCACGATGCT CGGCAAGCAG
GCCTACGAAG CTTGGGGCTG GCGCCTTAAC TTCATCGTGG CCGGTATCGT TGTGTTTGTC
GTGGTCTTGG TAATGCATCT CTTCTACAAG GAGACGCCTG TGTTCTCCAT GTTGAAAGCA
GTGCGGAGGG TCACCTCTGC GCCTATAAGA GAGGTGTTCT CAGCCCGCTA TCTGCCTCTC
GTGTTGCTTG CGTGGATAGG CGTCGTCGGT GCCCACGGCC CAATTTGGTA CACCAACCAG
CTATTCAACA CCTACTACAT CGGCCCCAAC TTCCGGAACT ATGTAGACGC GGCCACTGCC
AGCGCCCTAT TATCCACGGC CACCTACGCC GCTCTTTGGA CTTACCCGCT CTTCGGCTAC
TTATCGGATA AGATCGGGAG GAAGCCGATT CTACTCTTGG GTATCTACGG CAACGCGCTG
TGGTTCCCCA TAGCCTTCTG GTTAATTGAC CAGGTGGGGC CGCAGAAAGA CCTAACAGCT
ATGTGGCTGT TGTTCTGGTC CATGACCCTG TTCAACGGGA TCGGCTACAG CGGCGCCATG
TCGGCGTTTC TCCTAGAGCT ATTCCCAGCG AGGATTAGGC TTTCCGCCGT GTCTCTGGCC
TACAACCTCG GCTACGGCAT AACCGGAGGG CTGACGCCGT TTGTAATTAC GTGGCTATAT
TCAGCTACTA AAAACATCTA CATCTCCACC CTAATGTACT CGACGGTGGT GCCGATGGTA
ATGGCCTTGT GGTATCTGTT AAGGGGTCCA GAGACGCTTG GTACTAGGAT CTGGGCCGAG
TTCGCCGCCG AGAAGTTCGC CAAGAAGACA GTCACCCTGC CGGCGACCGC CCCTATTAGG
GAGGTTGTAT CTGCCTTAGC CTCTACTGGG AGCAAATACG CCGTGTTAGT AGGTAGCGTT
GCTGGTATCT TCGGCACGCG TTGCTTGATT AGGGCGCTGA GCGCCGGCGC GAAGATGGAA
GAGCCGGCAG TTAACTACGC CGTGAAGGTG CCGTGTATAC AAGCTGATCA CCCGGTGACT
GAGGTCTTCG TCGCGTTGGA GCAGTACAAC GTCAGGGCCG TGCCGATTTG CAAAGGGAGC
GAGGTGGTGG GCATAGTGGA GGCTAGGGAG CTGATAAACG AGGCCCTGGG GCTGAAGAGC
GCGTTCAAGA AGAAGGTTGC TCTGCGCTTC TCCGTGGCCG ACGCTGCGCC GAGAGAGCTC
ATGACCATAA GCCCTGAGAC CACGTTGAAG GAGGCCGTGG ATCTCATGGC GAAGAACAAC
ATTGGCTTCC TGCCCATCGT CAGCGGCGGC AAGCTGGTGG GCGTGCTTTC AGAAAGCGAT
GTGTTAAAAC TTGCGACGAG GGGGATTGAC TTGTCGGCGC CGGTGGCCAC GGTTATGAAC
TCAAAACCCA TAACAATAGG TAAAGACGCC ACGTTGAGAG ACGCCGCCGA GCTGATGGTG
AAGCACAACA TTAGGCACCT CCCCGTGGTG GACGGCGACA AGGTCGTCGC GGTAGTGTCT
GTAAAAGACG TCGTAAGAGT TATCGGATAG
 
Protein sequence
MSVQVSFRQV AGVAAVAWSG AFLEWVDFYT YALLAGTVAK VYFPSKDPIA SLLASFAALA 
IGFLFRPLGA ILFGKIGDQF GRKVAFITAM SLMLAGTLGI GLLPGYAEIG VLASVGVFLL
RIVQGLALGG GYGAAITYLG EFVPEHRRGL FTGFLFTTPA AGMATVGALI WLFSTMLGKQ
AYEAWGWRLN FIVAGIVVFV VVLVMHLFYK ETPVFSMLKA VRRVTSAPIR EVFSARYLPL
VLLAWIGVVG AHGPIWYTNQ LFNTYYIGPN FRNYVDAATA SALLSTATYA ALWTYPLFGY
LSDKIGRKPI LLLGIYGNAL WFPIAFWLID QVGPQKDLTA MWLLFWSMTL FNGIGYSGAM
SAFLLELFPA RIRLSAVSLA YNLGYGITGG LTPFVITWLY SATKNIYIST LMYSTVVPMV
MALWYLLRGP ETLGTRIWAE FAAEKFAKKT VTLPATAPIR EVVSALASTG SKYAVLVGSV
AGIFGTRCLI RALSAGAKME EPAVNYAVKV PCIQADHPVT EVFVALEQYN VRAVPICKGS
EVVGIVEARE LINEALGLKS AFKKKVALRF SVADAAPREL MTISPETTLK EAVDLMAKNN
IGFLPIVSGG KLVGVLSESD VLKLATRGID LSAPVATVMN SKPITIGKDA TLRDAAELMV
KHNIRHLPVV DGDKVVAVVS VKDVVRVIG