Gene Pars_0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0357 
Symbol 
ID5054856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp307117 
End bp308244 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content46% 
IMG OID640467928 
Productglycosyl transferase, group 1 
Protein accessionYP_001152615 
Protein GI145590613 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATAACC TAGCTAAGCA CCTCAGAATG CTTGGCGTAG ATGCCTATGC TATTGACTGC 
ATCAAAACGG CTGGTCATGA AAACCAGGAG GATTTCATAT TAAGGGTGAA GTGCGATAGC
AATTTGCATA AGGTGCTGTG TAGGGAGGCA AGCGCGAGGT GCAATGTATT AAAAGAGGTT
GTCTTTGCTA GACGGGCGAC GAAGATTTTA GACAAATTCG ATGTGGTTCA CGTAAATACT
GCGTGGGTTG GTTTTACACT AAGCCTATTG TTGAGAAGAC CCAGGTTTGT TTACACCTGC
CACAACCCCT TGTGGCCTGA AGACCAAGTG CATTTCGGCG AGAAAATCGT CAGGATAGTG
GAGGGCCACG CTATGAGGAG GGCTCATGCC GTCATAGCGC TGAATAACAC GATGAAGAGA
TCCATTGAGG CTAAGGCCAA GGTGGGCCCA TCAAAAATCT TTATTGTGCC AAATGGCGTG
GACACAGAGT TTTATAGGCC AAACCTACCC TGCGAACATG TTAACGAGGA GTATGGGCTT
GAGGGTAAGA AGGTTGTGCT ATTCGTAGGC AGAGTTACTT GGGGCAAGGG CGTACACATA
CTATTAAAGG CCATTAAACG CCTCAGAGAT TTCTACAATG TCAGAGATGT CAAGGCTTTG
ATCGTGGGCC CCCTCTCGGG CTTCTACAAA TCCGACAAAC CCTCGAGCTA TGCCCAGTTA
CTCATGAGCT ATGCTAAAGC CAACAACCTA GGTGTCGTCT TCACGGGCTC TATAGATTCA
GACATGCTCA GATACGTATA CTCGTGTTCA CACGTATTAG TACTACCTTC GTATTTCGAG
GCTTTTGGAA TGGTTCTTAT AGAGGCTATG GCCTCGGGGA TACCGGTGAT AGGCTCCAGA
GCCGGCGGGA TACCAGATAT TATAGAAGAG GGCGTAAATG GATTTACATT CCCTGTGGGA
GATGATGTCA CACTAGCAGA GAAGCTATAT ACACTTTTGA CAGATGAATC TTTACATAAG
AATATGGCTA ATGCTGCGAG AAGCATAGCG GTAACAAGGT ATAGCTGGAA AATTGTTGCA
AAAAAACTAT TGAAATTATA TGAGATTGAA AACTCCATCC AGTCATAA
 
Protein sequence
MYNLAKHLRM LGVDAYAIDC IKTAGHENQE DFILRVKCDS NLHKVLCREA SARCNVLKEV 
VFARRATKIL DKFDVVHVNT AWVGFTLSLL LRRPRFVYTC HNPLWPEDQV HFGEKIVRIV
EGHAMRRAHA VIALNNTMKR SIEAKAKVGP SKIFIVPNGV DTEFYRPNLP CEHVNEEYGL
EGKKVVLFVG RVTWGKGVHI LLKAIKRLRD FYNVRDVKAL IVGPLSGFYK SDKPSSYAQL
LMSYAKANNL GVVFTGSIDS DMLRYVYSCS HVLVLPSYFE AFGMVLIEAM ASGIPVIGSR
AGGIPDIIEE GVNGFTFPVG DDVTLAEKLY TLLTDESLHK NMANAARSIA VTRYSWKIVA
KKLLKLYEIE NSIQS