Gene Acel_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0849 
Symbol 
ID4485732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp942380 
End bp943576 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID639729623 
Producthypothetical protein 
Protein accessionYP_872608 
Protein GI117928057 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.709656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCGTT CCCTGTTCTC CGGCGTCTCG GGCCTGCGGG CCCACCAGAC GATGCTGGAT 
GTCGTCGGCA ACAACATTGC CAATGTCAAC ACCGTCGGAT TCAAAGCCTC CCAAGTGGAA
TTTGAAGACA CCCTCAGCCA GATGCTCCGC GCCGCCGGCG CACCGCAGGG CGTTCAGGGC
GGCACCAACC CGGCCCAAGT AGGACTGGGT GTTCAGATTG CCGCCATCAG CACGACCTTC
ACCCAGGGTC CGGCCGAAAC CACCGGCGTC GACACGGATC TGATGATCCA AGGTGACGGT
TTTTTCGTGC TTGATGACGG TGGTCAGCGG GTGTACACCC GGAACGGCGC CTTCTCTTTC
GACGCGAACG GCAACCTCGT CAGCGCCAAC GGCGCTCTCG TGCAAGGCTG GCTGGCCAGC
GGCGGTGTGG TGAATACCAC TGGACCGGTC ACCGCGATAA AGCTCCCGCT CGGAACGTCG
ATGCCCCCAT CGGCGACGAC CACCGCGACC CTCGCCGGCA ACCTCCCGGC CGACGGTTCC
GGCTCGCCGA TCGACAACAA CCTCACGGTG TATGACGCCA AGGGGCAGGC CACCCAGTAC
ACCCTCGAGT ACAGCTACGA CACGACGTCC AACAGCTGGA AGCTGGACAT TCTGCCGCCG
GGCGGCGGTT CGCCGACCAA TGTGCCGTTG AGTTTCGATT CGACGACCGG TCAGTACAAC
GGCACCAACC CGCAAACCGT GACCCTGGGT GGGCAGAACA TCAGCCTCGA CCTCTCCGGG
CTGACCGCCT ACGGCGGCGG GAACACCGTC GAGGTGGTCT CGTCGGACGG TTCCGCCATG
GGCTCGCTCG CGTCGTACAC CATTTCGCCG GACGGCACCA TCCAGGGTGT CTTCACCAAT
GGGATGAAGC AACCGCTCGC CAAGATTGCC CTGGCGACCT TCAACAATCC CGGCGGTCTC
ACCAAGGTAG GCACGTCCGA GTACGCCGAA TCGGTGAACT CCGGCGCCGC GCAGATCGGT
GCGTCGGGGA CGGGAAGCCG CGGCCAGCTC GCCGCCGGTG AGCTCGAGGG CTCGAACGTC
GACCTGTCCC AGGAATTCAC CAACTTGATC ATTGCCGAGC GGGGTTTCCA GGCCAACGCG
AAGGTCATCA CCACATCCGA CCAAGTTCTC CAGGATCTGG TGAATCTCAA GCAGTAA
 
Protein sequence
MLRSLFSGVS GLRAHQTMLD VVGNNIANVN TVGFKASQVE FEDTLSQMLR AAGAPQGVQG 
GTNPAQVGLG VQIAAISTTF TQGPAETTGV DTDLMIQGDG FFVLDDGGQR VYTRNGAFSF
DANGNLVSAN GALVQGWLAS GGVVNTTGPV TAIKLPLGTS MPPSATTTAT LAGNLPADGS
GSPIDNNLTV YDAKGQATQY TLEYSYDTTS NSWKLDILPP GGGSPTNVPL SFDSTTGQYN
GTNPQTVTLG GQNISLDLSG LTAYGGGNTV EVVSSDGSAM GSLASYTISP DGTIQGVFTN
GMKQPLAKIA LATFNNPGGL TKVGTSEYAE SVNSGAAQIG ASGTGSRGQL AAGELEGSNV
DLSQEFTNLI IAERGFQANA KVITTSDQVL QDLVNLKQ