Gene Acel_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1939 
Symbol 
ID4486358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2196923 
End bp2198413 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID639730730 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_873697 
Protein GI117929146 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGTTC TTGATGAGGT CTCGACGCTT CCGCGTCGGA CCGATCGGGT GTTGCGTACA 
ACGCGGAGCC GGCGTGCCGT CCGGGACCGG CTCAGCTTCC TGCGTCGCTA TGGCCGGGTG
CTTGCCGTAG CGGAAGGGGT CATCGCGGCT CTTGCGGTTC TCGTGTATTC ATCCGGGTAC
GCCGGCCGGG TCGATCGAGC AGCCTGGCTC CAGGTCGTCG CGCTTGGCTT GGGCTGGCCG
GCGACCTTGG CGTTGCTGCG CGCGTACGAA CCCCGCTTTC TCGGGCTGGG GTCTGAGGAG
TATCGCCGCG TCGTGCATGC GGGACTCGGC CTCACCGCGT GCATTGCGAC CGCCGGGTAT
GCGACGTCCT CGCTCGGCGG CCGCGGCGTC GCTCTCTTTG CCGTGCCCGG CGCGATGACC
GCGACCCTCG TGACGCGTTA CGGTGCGCGA AAGTGGCTGC ATTTCCAGCG ACGGCAAGGC
CGCCACCTGC AACGGGTGCT GATTGTCGGA CACGATGCGA CTGCGGCTGA GCTTGCGGAG
GCGATGCGGC GGGAAGCGTA CGCGGGTCTC TTCGTCGTCG GTGCCTGCGT TCCCGGTGGG
AAGGCCGGTT CGCACCATCG GCTGGACGCC GCCGGTGTAC CGGTCATCGA CGATCTGGAG
TCGGTGACGC GGGCGGTCGT CGCGGTAGAC GCCGCGGCCG TCGCCGTCCT GCCGTGCCCG
GAATTGTGCG GACCCAAGCT CCGAAAACTC GGATGGGATC TCGAAGCGGC AGGAGTCGAT
CTCATCGTCG CTCCCACCAT CGTGGATGTC ACGGGTCCAC GCATTCACAT CCGGCCGCTC
GCGGGCTTGC CGCTCCTGCA CGTGGAAGCC CCCGAATTTC ACGGATTCCG CCGTGTCTTG
AAAGAAGCGT TCGACCGGTT CGCGGCGGCG ATCGCCCTTA TCGTTCTCAG CCCGCTTCTC
CTCGCCGTCG CGATTGCGGT GGTTGCAACG AGCGACGGCG GTGCATTCTT CTGCCAGCAG
CGGGTCGGCA AGGGCGGCAA GTCTTTCCGG ATGTACAAAT TCCGGTCCAT GTACGCCGAC
GCTGAGCACC GGCTCACCGA GTTGCTGGAC AAGAACAAGC ATGGTGCTAC TGGTGTGCTG
TTCAAACTCG TCGACGATCC GCGGGTGACG CCGGTCGGCC GATTCCTCCG GCGGTATTCC
CTCGATGAAT TGCCGCAATT GGTCAACGTT CTTCTCGGTC ATATGTCGCT CGTCGGTCCT
CGGCCGCCGT TGGCCCGCGA AGTCGCCATG TATGGGCCGG AAGCGAAACG CCGCCTCCTC
GTCAAGCCGG GCCTCACCGG GCTCTGGCAA ATCAGCGGCC GGTCTGACCT CGATTGGCAG
ACCTCCGTGC GGCTTGACCT CTGGTACGTC GAGAACTGGT CCTTCTGGTT GGATCTCATG
ATTTTGTGGA AGACCGCCTT TGCTGTCGTC CGTGGGTCCG GTGCGTATTG A
 
Protein sequence
MVVLDEVSTL PRRTDRVLRT TRSRRAVRDR LSFLRRYGRV LAVAEGVIAA LAVLVYSSGY 
AGRVDRAAWL QVVALGLGWP ATLALLRAYE PRFLGLGSEE YRRVVHAGLG LTACIATAGY
ATSSLGGRGV ALFAVPGAMT ATLVTRYGAR KWLHFQRRQG RHLQRVLIVG HDATAAELAE
AMRREAYAGL FVVGACVPGG KAGSHHRLDA AGVPVIDDLE SVTRAVVAVD AAAVAVLPCP
ELCGPKLRKL GWDLEAAGVD LIVAPTIVDV TGPRIHIRPL AGLPLLHVEA PEFHGFRRVL
KEAFDRFAAA IALIVLSPLL LAVAIAVVAT SDGGAFFCQQ RVGKGGKSFR MYKFRSMYAD
AEHRLTELLD KNKHGATGVL FKLVDDPRVT PVGRFLRRYS LDELPQLVNV LLGHMSLVGP
RPPLAREVAM YGPEAKRRLL VKPGLTGLWQ ISGRSDLDWQ TSVRLDLWYV ENWSFWLDLM
ILWKTAFAVV RGSGAY