Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0834 |
Symbol | |
ID | 4485880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 928911 |
End bp | 930110 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639729608 |
Product | flagellin domain-containing protein |
Protein accession | YP_872593 |
Protein GI | 117928042 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.9933 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGC GCATCAACAC CAACGTCGAC GCACTCGACG CCTACCGGAA CCTCTCGATG ACGCAGAGCA AGCTGTCGGA CAGCTTGCAG AAGCTCTCCA GTGGTTACCG GATCAACAAG GCTGCGGACG ACGCAGCCGG TCTGTCCATC AGCCAGAAAT TGCAGGCACA GATCGGCGGT CTCCAGCAAG CCGTGAAGAA CGCTCAGGAT GGAATCAACG TCGTGCAGAC CGCCGACGGC GCCCTCAATG AGGTGCAGTC GATTCTGCAG CGCATGAACG TCCTTGCCGT GCAGGCGGCG AACACCGGTT CACAAGACCA AGCCGCCCGC CAAGCCGCGC AAACGGAAAT CCAGCAGCTG AACGCTGACC TCGACGCGAT CGGCAACAAC ACGCAGTTCG GTCAGAGCAA GCTCCTCGAC GGATCGTTCG GCAGCTCGGC GGCGGCGAAG AGCTTTGCGG TCAGCGGTGG CTCGGTCACG GCGGCGACCG CCAGCTTCAA GATTTCCGGC ACCTTCAACG GCGTCGCCTT GGCAAACGCC ACGGTCAACG TCGCCAACGG TACCTACAGC ACCGCCAGTT CGTTGCAGAC CGCATTGCAG AACGGGATCG ACGCGACTCT CACCGCCAAC GGCATCACCG CCGGTGCAGT GCAGGCCAGC GTGACCGACG AGGGCAACGG TGTCTGGAAG GTGACCCTGA GCTCGTCCGC GGTGGGCGCC GGTAACACGT TCTCCACGTC TGCGACGACG GGCCTCACCG ACGGTAACGG CAATGCCGTC AGCCTCAACG GCACGACCTC GGCGCAGGGC TCCGGCGGTG GTGTGTTCCA GATCGGCGCC CAGGCCGGCC AAACCCAGAC TGTCCAGATC GGGGCGGTGA GCGCCAACGC GCTTGGGACC GACACCATCG ACCTGGTCAA CAACGCAAGC GCTGCTATCG CGACGATCGC GAGCGCTATT ACGACCGTGT CGACCGAGCG GTCCAGCCTC GGTGCATACC AGAACGGGTT CCAGCACATC ATCAACAACC TGAATGTGAC AGTGGAGAAC CTGCAGGCGT CCAACTCCAC CATCCAGGAC ACCGATATGG CCCAAGAGAT GGTGCACTTC ACGCAGGCGC AGGTTCTTCA GCAGGCCGGT GTGTCGATGC TGGCGCAGGC CAACGTCGAA ACGCAGGCTG TCCTGAAGCT GCTGCAGTAG
|
Protein sequence | MGLRINTNVD ALDAYRNLSM TQSKLSDSLQ KLSSGYRINK AADDAAGLSI SQKLQAQIGG LQQAVKNAQD GINVVQTADG ALNEVQSILQ RMNVLAVQAA NTGSQDQAAR QAAQTEIQQL NADLDAIGNN TQFGQSKLLD GSFGSSAAAK SFAVSGGSVT AATASFKISG TFNGVALANA TVNVANGTYS TASSLQTALQ NGIDATLTAN GITAGAVQAS VTDEGNGVWK VTLSSSAVGA GNTFSTSATT GLTDGNGNAV SLNGTTSAQG SGGGVFQIGA QAGQTQTVQI GAVSANALGT DTIDLVNNAS AAIATIASAI TTVSTERSSL GAYQNGFQHI INNLNVTVEN LQASNSTIQD TDMAQEMVHF TQAQVLQQAG VSMLAQANVE TQAVLKLLQ
|
| |