Gene Acel_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0413 
Symbol 
ID4485971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp430731 
End bp431798 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID639729180 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_872173 
Protein GI117927622 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0183855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAC TCGTGCTCTC CGGGGGAGCG GGCACCCGGC TCCGTCCGAT CACCCACACC 
TCCGCCAAGC AACTCGTCCC GGTGGCGAAC AAGCCGGTGT TGTTCTACGG CCTGGAGGCG
ATCCGGGACG CCGGGATCAC CGACGTCGGC ATCATCGTCG GCGACACGCG CGCCGAGATC
GAGGCCGCGG TGGGCGACGG CTCAGCGCTG GGCATCAAGG CGACGTACAT CCATCAGGAG
GCGCCGCTCG GCCTCGCCCA CTGTGTGCTG ATTGCGCGGG ATTTTCTCGG CGATGACGAC
TTCGTCATGT ACCTCGGGGA CAACTTCATC ATCGGCGGCA TTACCGATCT CGTTCAGGAA
TTCGTCCGGT GCGGCGCGGA TGCGCAAATT CTGCTCACCA AGGTGGACAA CCCTCAACAA
TTCGGGATTG CCGAGCTTGA CGAGGAGGGG CGCGTCGTCC GCCTGGTGGA GAAACCGGCC
CAGCCTCGCA GCGACCTCGC TTTGGTCGGC GTCTACATGT TCAAACCGGC GATTCACCAG
GCGGTGCGTG CGATCCGCCC GAGCGCCCGG GGTGAATTAG AAATCACTGA TGCCATCCAG
TGGCTGGTCG ACAACGGGTA CAACGTTCGG TCCCACTTTG TCAACGGTTA CTGGAAAGAC
ACCGGGCGGC TCGAGGACAT GCTGGAGTGC AATCGGAAAG TGCTCGAGAC GATCGAGCCG
GCCTGCCGCG GGCGGGTCGA CGCCGAGAGC CGCATCATCG GACGGGTGGT GATTGAGGAC
GGCGCTGTCA TCGAGCGGTC CACGGTTCGC GGCCCGGCGA TTATCGGCAA GGGCACGAAA
ATCATCGAGA GTTACGTCGG ACCCTTCACG TCGATTTACC ACGACTGCGT CATCGAGCGC
ACCGAGATCG AACACTCCAT CATCTTGGAA GCCACGAAGA TCATCGGAGT GAGCCGCATC
GAGGACTCGT TGATCGGCAA AGAAGTGGAG GTGGCGCCGT CCACCGCGCT GCCTCGGGCG
CACCGGCTCA TGCTCGGCGA TCACAGTAAG GTCTCGATCG CGTACTGA
 
Protein sequence
MKALVLSGGA GTRLRPITHT SAKQLVPVAN KPVLFYGLEA IRDAGITDVG IIVGDTRAEI 
EAAVGDGSAL GIKATYIHQE APLGLAHCVL IARDFLGDDD FVMYLGDNFI IGGITDLVQE
FVRCGADAQI LLTKVDNPQQ FGIAELDEEG RVVRLVEKPA QPRSDLALVG VYMFKPAIHQ
AVRAIRPSAR GELEITDAIQ WLVDNGYNVR SHFVNGYWKD TGRLEDMLEC NRKVLETIEP
ACRGRVDAES RIIGRVVIED GAVIERSTVR GPAIIGKGTK IIESYVGPFT SIYHDCVIER
TEIEHSIILE ATKIIGVSRI EDSLIGKEVE VAPSTALPRA HRLMLGDHSK VSIAY