Gene Haur_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4228 
Symbol 
ID5736082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5389144 
End bp5390625 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content53% 
IMG OID641281383 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001546988 
Protein GI159900741 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00880688 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCG ATTGGTCTAC CAGCCAACCG ATTTTTTCGC AGCGCGATGT GCGTCAAACC 
ACATCGCGGC TGGCTCTGAC CTTGCTCGAT GGATGTTTGA TTTTGCTAGC CTTTGCCGTA
GCGCACTGGC TGCGTTACGA TGTCCGCTTA GGCCGCGATA TTTACGACCC AGCTTCATAT
CGCCAACTCT CGGCCTTCTA CCCGATGATG TTGGTGTTTA TGCTGACGCT GATTAGCACG
TTGCACTGGC GCGGGTTTTA TCGCCTGCCC CGTTCAGCCT CAGCCTTCGA TTCATTTAGT
ATTATCGTTA CCAGCACCAC GATTGCCCTT GCCCTCACGG TCATGTGGCT GTTTATCAAT
CGCGCCGATT TATGGTCGCG CTTGATTATG GTGTTTGTTT GGTTCTGCGT AATTGTGGCG
CTGACGCTGG GTCGGATTAG TTTGCGCATG CTGAGGCGTT GGGCTTGGCG ACGCGGCGTT
GGCTTAGAGC AAGTCGTGGT GGTTGGCAAT CGTGGCCTAG CCAAACAGGT GATGGAAGAA
TTGCAGTATA CGCTCGATCA TGGCCATCAT TTGTTGGGTT ATGTCGAAGG ACCGCCTGAT
GATCGCGACG ATGTAGCCGC ACCAGGTGAG CAATTTCGCT GGCTTGGCAC GCTCAGCCAA
TTCGAGCAGA TTATGCGTCA GCGCCATGTC GATCAAGTGA TTATTGCCTT GCCATTTTGG
GCGCACACCA GCCTACCCGA AGTTGTGGCA ATTTGCCGTA AGTTCAACAT TGAATTTCGC
GTCGCCCCCG ATTTATATGA ACTCAGCTTC GATCGCGTCA GCATTCAGCG CTTGAGCACG
ATTCCCCTGT TGCGCTTGAA AAAGAATGTG ATTCGCGGCT GGAATTATGT GTTCAAACGC
AGCACCGATT TGCTGATGAT TGGCCTGACT GCGCCGATTT GGGCCACAAT TTGGGGCTTG
GCCGCCTTGA TGATCAAACT TTCTGATCCA CAAGCGCCGG TAGTGTTTCG CCAACCACGC
ATCGGCAAGC ATGGCCAAAC ATTTATGGTC TACAAATTAC GCACGATGGT GCCCAATGCC
GAGGCGCTCA AAAAGAGCCT GATGGATCAA AATGAGGCCG AGGGAGCACT GTTCAAAATC
AAAGACGACC CACGAGTCAC CCGCTTAGGC CGGATTTTAC GCAAACTGAG CATCGATGAA
CTGCCCCAAC TCTACAATGT GCTACGCGGC GAAATGAGCC TGGTTGGCCC ACGCCCGCAA
GTGCCCGACG AAGTAGCCCA ATATCAAGAA TGGCACTATC GCCGTTTGGA AGTGACCCCA
GGTTTGACTG GTTTATGGCA AGCCTCAGGC CGCTCCAACA CCACCTTCGA TGATATGGTG
CGCTTGGATA TTTACTACAC CGAGCACTGG TCGCTCTGGC TTGATCTGCG GATTATGATC
ATGACGATTC CAGCGGTGCT ATTTGGTCGT GGCGCATACT AA
 
Protein sequence
MMSDWSTSQP IFSQRDVRQT TSRLALTLLD GCLILLAFAV AHWLRYDVRL GRDIYDPASY 
RQLSAFYPMM LVFMLTLIST LHWRGFYRLP RSASAFDSFS IIVTSTTIAL ALTVMWLFIN
RADLWSRLIM VFVWFCVIVA LTLGRISLRM LRRWAWRRGV GLEQVVVVGN RGLAKQVMEE
LQYTLDHGHH LLGYVEGPPD DRDDVAAPGE QFRWLGTLSQ FEQIMRQRHV DQVIIALPFW
AHTSLPEVVA ICRKFNIEFR VAPDLYELSF DRVSIQRLST IPLLRLKKNV IRGWNYVFKR
STDLLMIGLT APIWATIWGL AALMIKLSDP QAPVVFRQPR IGKHGQTFMV YKLRTMVPNA
EALKKSLMDQ NEAEGALFKI KDDPRVTRLG RILRKLSIDE LPQLYNVLRG EMSLVGPRPQ
VPDEVAQYQE WHYRRLEVTP GLTGLWQASG RSNTTFDDMV RLDIYYTEHW SLWLDLRIMI
MTIPAVLFGR GAY