Gene Pars_0708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0708 
Symbol 
ID5055274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp630706 
End bp632247 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content61% 
IMG OID640468265 
Productglycosyl transferase family protein 
Protein accessionYP_001152946 
Protein GI145590944 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.43114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.211707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAACA TCACCCTGCC GAATATCACC ATCCCGCAGA TATCGCCAGA CGTCCTCAAC 
GACACACTGC GAATCCTCAA CAAGACGCGC CAAGGCCCCG TGACGGTGCC AAACGCCCCC
GCCATCCCGC CCTGGCTTGA AAACGCGTTG CTGACAATAT ACCTCGCCCT ACTGGCCTTG
TCCATACTGC TAATAGCCCA CTACATCTAC TACGCCCGGC ACTCCAGCCG CCAGCCCCCC
GACCTCCGCG CCGACCCCGC CGACGCCCCA CTCTCCATAA TAATTCCGGT AAAAAACGAA
AGCCCCGAGA CCGTTGCCCA AGCCGTAAAG AGACTGGCGG CGCTTAACTG CCCCGACGCG
GAGATCATAG TGGTGTCAGA CGACCCCCCA GACGCCTTCG AGGAGATTAG AAAGGCGGTG
GAGAGCCTCG CAGTCCCCAA CGCCAAGGCG CTGAGGAGGC CCCAGCCCGT GGGCTACAAG
GGAGTTGCGC TGAACTGGGC CGCGGAGCGG GCCAGGGGGG AGATCCTCCT CTTCCTCGAC
GTAGACAGCG TGCCGCCCCC CGACCTATGC CACAGGGCCA GGGCGGTGGG GGAGAGGGAG
ATCCTATTCC TCGGCTGGGA CGGCTACGCC CCCGTCAAAA CCCCCATAGC AACCCTCCAG
CTCTTCCTCT ACAAGTATTT GCTATTTCAC GTAGCGATAG TGGGACGCCA CAATACGGGA
CACCCAATCT TCGCCTTGGG GTCGGGGATA GCGGTGAGGA AGAAGTTCCT CCAGGAGATG
GGAGGCTTCT GCAACTGCAC CGCCGACGAC TACGACATAT CCATGAAGGC GTATCTACAC
GGAGGCAGAG TGGTATACTC GCCGGGGGTT CCAGTATATG TCGAGGTGCC TGGCGGCTAT
GCCGCGTTCA AGAAGCAGTA CGCCCGCTGG ACCTACAACT CGGCGTACTT ATTGGCGGCG
TACGGGCTGA AGATATTTAG GCTGTGGATG CCCCTCCCCC ACAGAATAAG CGTCTTCCTC
AACGTGGCCA CCCACCCCCT CATGATAATG ACAACATTCG CCATAATGCT CTCGGGACTG
GCAATGGGCT ACATGGGCAT CCTCCTGCCG CCCCTCCACA TACTCATACT ACAGCTGGCC
CTGGGCATAG CGGCGCTGGT CCAGGTATAC TACGTCTACA AGCTGGCGAG GCGCGACGGG
CACAGCTTCG TTGCGGTGGC GGGGAAGCTT GCCAAGTCAG CCGCCCTCCT CCTAGTCCTC
AGCCCCTACC TAGCCTTCTA CGTGGTCCTC GGCACACTCC GGAGGAGAAT AAGGTGGCAC
ATCACCCCCA AAGGCCTCGC CTCCGTCTTG TCGGGCCGAG CCGGGCCGTA CGAGATAGGC
CTCGCCGCGG CCCTAGGCGC CCTCTTCGCC TACGCCCTAA CCACCGCAAA CCCAGTCTTC
ATAACAAACA CCGCCTTCCT CCTAGCCGCC ACCCTCTACG TACTTACAAA AATAACTAGC
CCTCCGCCAA GGAGCGACGC GACACAATCG GGTACGGGAT AA
 
Protein sequence
MVNITLPNIT IPQISPDVLN DTLRILNKTR QGPVTVPNAP AIPPWLENAL LTIYLALLAL 
SILLIAHYIY YARHSSRQPP DLRADPADAP LSIIIPVKNE SPETVAQAVK RLAALNCPDA
EIIVVSDDPP DAFEEIRKAV ESLAVPNAKA LRRPQPVGYK GVALNWAAER ARGEILLFLD
VDSVPPPDLC HRARAVGERE ILFLGWDGYA PVKTPIATLQ LFLYKYLLFH VAIVGRHNTG
HPIFALGSGI AVRKKFLQEM GGFCNCTADD YDISMKAYLH GGRVVYSPGV PVYVEVPGGY
AAFKKQYARW TYNSAYLLAA YGLKIFRLWM PLPHRISVFL NVATHPLMIM TTFAIMLSGL
AMGYMGILLP PLHILILQLA LGIAALVQVY YVYKLARRDG HSFVAVAGKL AKSAALLLVL
SPYLAFYVVL GTLRRRIRWH ITPKGLASVL SGRAGPYEIG LAAALGALFA YALTTANPVF
ITNTAFLLAA TLYVLTKITS PPPRSDATQS GTG