Gene Pcal_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_2037 
Symbol 
ID4909580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp1891006 
End bp1892196 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID640125790 
Productglycosyl transferase, group 1 
Protein accessionYP_001056918 
Protein GI126460640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000383796 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCTTG TGGCACACCA CTACTGGGGC TCCCCCGGCG GGGGCCAGTT GGTCTGCGCC 
GCGGCGGCCT ACGCTCTGGA CAAGGCGGGG TTGGCGCCCG TGCTCAGCGG CACGTTTAAG
TTTGACCCGG GGAAGTACGT GGAGTGGTAC GGCATAGACA TCTCCAAGTA CCCCCGCGTA
ACTCTGCCAG TGGGCGCCAA GGCCTTCGGC CTTTGGGCCA GGCTCCTGGT GTGGTTGCCG
GCCAAGAGGG CCGTTGAGAA GTACAGGCCG CGGCTCATTT TCACAGACGA GGTGGCGTAT
AAGCCCATCG CCGGCGCGGC GCCCCTGGTG GAGTACATAC ACTTCCCCTT CGAGGTGTTC
ATAGACCCCC GCTTCAGGGG CACCGGCCTG GCCTATGGGG AGGATCCCTA CATAACAGAG
CGCTACTCCC GCTTCCCGCT GAGCCTCTAC TGGCGCATCT ACGTCAAGCT GTTGCCAAGG
TACGCCAGGG AGAACCCCTT CCACTACGCC AGCCTAGTCC TCGCCAACTC AAGCTGGACC
GCCGACGTGG CCAAGGAGGT ATATGGGGAG AGGCCAACCG TCCTCAACCC CCCAATTGCG
CCCAACGTAG AGGTGGTGGA GTCGCCTAGG CCCTTCGAGG AGAGGGAGCC CGCCGTGGTT
ATGCTGGGCC GCTTCTCGCA GGAGAAGCGC TACCACTGGG CCGTCACAGA GGTGGCGCCG
CGCCTCGTGA AAGAGGTGCC GGGCGCAATG CTGTACATCT TCGGCGGCGC CGCCACGCCC
ACGCTGAGGG CCTACATGGA GGAGGTGAAG AGGCTGGCTG AGAAAAGCGG CGTGGCACAC
GCCGTCCGCC TAATCCCCAA TGCCCCGAGG CGGGAGATAA ACGCCACCAT GGACAGGGCC
AGGGCCTTCT TCCACGCCAC GATAAACGAG CACTGGGGGA TAGCCGTGGC CGAGGCCATG
GCCAGGGGGC TACCCCCCGT GGTCCACAAA AGCGGAGGCA CGTGGAGCGA CTTGGCCCAG
GGGGCCGGGC TGGGCTACGC AAGCGCTGAG GAGGCAGTGG AGCAGTTGGC CAAGTTCCTC
ACAGACCCCA AGGCCTGGAA AGCCGCGTCC GCCGCCTCCG TCGCCAAGGC AAAGGGTCTA
ACACTAGACG TCTTTGCCAA AAAGCTGGCC GACTTAGTGT CGGCGATTTA A
 
Protein sequence
MSLVAHHYWG SPGGGQLVCA AAAYALDKAG LAPVLSGTFK FDPGKYVEWY GIDISKYPRV 
TLPVGAKAFG LWARLLVWLP AKRAVEKYRP RLIFTDEVAY KPIAGAAPLV EYIHFPFEVF
IDPRFRGTGL AYGEDPYITE RYSRFPLSLY WRIYVKLLPR YARENPFHYA SLVLANSSWT
ADVAKEVYGE RPTVLNPPIA PNVEVVESPR PFEEREPAVV MLGRFSQEKR YHWAVTEVAP
RLVKEVPGAM LYIFGGAATP TLRAYMEEVK RLAEKSGVAH AVRLIPNAPR REINATMDRA
RAFFHATINE HWGIAVAEAM ARGLPPVVHK SGGTWSDLAQ GAGLGYASAE EAVEQLAKFL
TDPKAWKAAS AASVAKAKGL TLDVFAKKLA DLVSAI