Gene Acel_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0444 
Symbol 
ID4485194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp476740 
End bp477915 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content62% 
IMG OID639729211 
ProductUDP-sulfoquinovose synthase 
Protein accessionYP_872204 
Protein GI117927653 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.52178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.455696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTCC TGATTCTCGG CGGTGACGGT TTCTGCGGTT GGCCGACCTC CCTGCACCTG 
TCGGCACAAG GGCATGACGT GCACATTGTC GACAACTTCG CCCGGCGGTG CGCGGACATC
GAATTGGAAG CCGAATCGCT CACCCCGATC GCGCCGATGG GAACCCGGTT GCGCGCCTGG
CGTGAGGTGA GCGGCAAGGA GATCGAATTC TCCCGGTTCG ACGTCGCGGT GCACTATCAC
CGGCTGCTCA CCCTGCTGCA GGAGTGGCAG CCGGACGCGG TTGTGCACTT CGCCGAACAG
CGGGCCGCGC CGTACTCGAT GAAATCGTCG TGGCACAAGC GGTACACGGT GAACAACAAC
ATCAACGCGA CGAACAACCT GCTCGCCGCC ATCGTCGAAT CCGGGCTGGA CATTCACGTC
GTCCACCTCG GAACGATGGG CGTGTACGGC TACGGCACCG CCGGGATCAA AATTCCCGAA
GGATACCTGC GGGTGCAGAT TCCCAAGGAG AACGGCGAAG TCGTTGAATC GGAAATCCTC
TACCCGCCGA ACCCGGGGTC GATTTATCAC ATGACGAAGA CGCAGGACCA GCTGCTCTTC
GCCTACTACA ACAAGAACGA CGGGGTGCGG GTCACCGACC TGCACCAGGG CATCGTCTGG
GGCACCCAGA CTGTCGAGAC CCGGCTCGAC GACCGGCTCA TCAACCGATT CGATTACGAC
GGCGATTACG GAACTGTGCT GAACCGGTTC CTCGTCGAAG CCGCGATCGG ATATCCGCTG
ACCGTGCACG GATCGGGCGG CCAGACCCGC GCGTTCATCA ACATTCAAGA CACCGTGCGG
TGCATTCAGC TTGCGGTCGA GAATCCGCCC AACCCCGGGG AGCGGGTGCG GGTCTTCAAC
CAGATGACCG AGTGTCACCG GATCATCGAC TTGGCCAAGC TGGTCTCCGA GCTCACCGGC
GTGGAGATCG ATCACGTGGA GAATCCGCGG AACGAAGCGG ACTCCAACGA CCTGTTCGCC
GAGAACCGGC AGCTCCTCGA ACTCGGGTTG AAGCCGATCA CCCTGGAGGC CGGGCTGCTC
ACCGAAATCA CCGAGATCGC GCGGAAGTAC GCCGACCGGA TCGACGTCGA CAAGATCCCG
TGCCGGTCGT ACTGGCGTCC GAAGCGGAGT GTGTGA
 
Protein sequence
MRVLILGGDG FCGWPTSLHL SAQGHDVHIV DNFARRCADI ELEAESLTPI APMGTRLRAW 
REVSGKEIEF SRFDVAVHYH RLLTLLQEWQ PDAVVHFAEQ RAAPYSMKSS WHKRYTVNNN
INATNNLLAA IVESGLDIHV VHLGTMGVYG YGTAGIKIPE GYLRVQIPKE NGEVVESEIL
YPPNPGSIYH MTKTQDQLLF AYYNKNDGVR VTDLHQGIVW GTQTVETRLD DRLINRFDYD
GDYGTVLNRF LVEAAIGYPL TVHGSGGQTR AFINIQDTVR CIQLAVENPP NPGERVRVFN
QMTECHRIID LAKLVSELTG VEIDHVENPR NEADSNDLFA ENRQLLELGL KPITLEAGLL
TEITEIARKY ADRIDVDKIP CRSYWRPKRS V