Gene Acel_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0834 
Symbol 
ID4485880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp928911 
End bp930110 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID639729608 
Productflagellin domain-containing protein 
Protein accessionYP_872593 
Protein GI117928042 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.9933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTGC GCATCAACAC CAACGTCGAC GCACTCGACG CCTACCGGAA CCTCTCGATG 
ACGCAGAGCA AGCTGTCGGA CAGCTTGCAG AAGCTCTCCA GTGGTTACCG GATCAACAAG
GCTGCGGACG ACGCAGCCGG TCTGTCCATC AGCCAGAAAT TGCAGGCACA GATCGGCGGT
CTCCAGCAAG CCGTGAAGAA CGCTCAGGAT GGAATCAACG TCGTGCAGAC CGCCGACGGC
GCCCTCAATG AGGTGCAGTC GATTCTGCAG CGCATGAACG TCCTTGCCGT GCAGGCGGCG
AACACCGGTT CACAAGACCA AGCCGCCCGC CAAGCCGCGC AAACGGAAAT CCAGCAGCTG
AACGCTGACC TCGACGCGAT CGGCAACAAC ACGCAGTTCG GTCAGAGCAA GCTCCTCGAC
GGATCGTTCG GCAGCTCGGC GGCGGCGAAG AGCTTTGCGG TCAGCGGTGG CTCGGTCACG
GCGGCGACCG CCAGCTTCAA GATTTCCGGC ACCTTCAACG GCGTCGCCTT GGCAAACGCC
ACGGTCAACG TCGCCAACGG TACCTACAGC ACCGCCAGTT CGTTGCAGAC CGCATTGCAG
AACGGGATCG ACGCGACTCT CACCGCCAAC GGCATCACCG CCGGTGCAGT GCAGGCCAGC
GTGACCGACG AGGGCAACGG TGTCTGGAAG GTGACCCTGA GCTCGTCCGC GGTGGGCGCC
GGTAACACGT TCTCCACGTC TGCGACGACG GGCCTCACCG ACGGTAACGG CAATGCCGTC
AGCCTCAACG GCACGACCTC GGCGCAGGGC TCCGGCGGTG GTGTGTTCCA GATCGGCGCC
CAGGCCGGCC AAACCCAGAC TGTCCAGATC GGGGCGGTGA GCGCCAACGC GCTTGGGACC
GACACCATCG ACCTGGTCAA CAACGCAAGC GCTGCTATCG CGACGATCGC GAGCGCTATT
ACGACCGTGT CGACCGAGCG GTCCAGCCTC GGTGCATACC AGAACGGGTT CCAGCACATC
ATCAACAACC TGAATGTGAC AGTGGAGAAC CTGCAGGCGT CCAACTCCAC CATCCAGGAC
ACCGATATGG CCCAAGAGAT GGTGCACTTC ACGCAGGCGC AGGTTCTTCA GCAGGCCGGT
GTGTCGATGC TGGCGCAGGC CAACGTCGAA ACGCAGGCTG TCCTGAAGCT GCTGCAGTAG
 
Protein sequence
MGLRINTNVD ALDAYRNLSM TQSKLSDSLQ KLSSGYRINK AADDAAGLSI SQKLQAQIGG 
LQQAVKNAQD GINVVQTADG ALNEVQSILQ RMNVLAVQAA NTGSQDQAAR QAAQTEIQQL
NADLDAIGNN TQFGQSKLLD GSFGSSAAAK SFAVSGGSVT AATASFKISG TFNGVALANA
TVNVANGTYS TASSLQTALQ NGIDATLTAN GITAGAVQAS VTDEGNGVWK VTLSSSAVGA
GNTFSTSATT GLTDGNGNAV SLNGTTSAQG SGGGVFQIGA QAGQTQTVQI GAVSANALGT
DTIDLVNNAS AAIATIASAI TTVSTERSSL GAYQNGFQHI INNLNVTVEN LQASNSTIQD
TDMAQEMVHF TQAQVLQQAG VSMLAQANVE TQAVLKLLQ