Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0817 |
Symbol | |
ID | 4486305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 908345 |
End bp | 910378 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639729590 |
Product | hypothetical protein |
Protein accession | YP_872576 |
Protein GI | 117928025 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.062184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.168506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAGTT GGCGTGACCA AATCCTGAGT GAGTTCACAC CCCAGGTCGC CCGGCTTACC CTGGTCGCGG ATCCGGACGG CCTGCTCCTT GAAGAAGGTG TGCTCGAAGG GATCCGGGAA CGGGGCTTTG AGCTGATCCC GTTCGAGGAT CATGTCGCCT TTCGCTATGC CTACGAATCC AAGTTCCGCT CCCGCTGGGA CCGTCGTGAG GAAACCGATC TGGTGGTGGT GCTGAGATCG CCGTCGAGCG ATCTCGATGC CTTGCCCTAC GACCTGCTGC AGGCTGGCCG CAAGCTCTCT TTCAACCTGG GCGATCTCTT CCCGAATCTC AGCTACCCCG TGGTCGCGGT ACTCGACCGG GCTGATCTCG ACGTCTTGTT CGATGCGCAA TTGAGGCACG CGCCGGGGCC GCTCGGCGAC AACGCGACGA AGGAGTTCGT CCTTCGTCAT GTGTTCGAGA TCGCGCCGGA ACTCATCAAG CAGCCATCTG ACCTACTCCG GGTCTTGCTG CGTCGCCATT ACCGCGGACA ACGAATCCCC CACATCCTCG ATGAGCGTTT TATCCAGGTG CTTCGTCAAA ACGGCCTATT TGAGGATTGG CCTCTGGAAG CCATCATTCC GGACGAGCAA GCATTCTTTG CGTTCCTGCA GGAGCGTTGG CCGGTGTTCC TGGATTCGCT CGCGAATTCA AACGATGATG CCGTCTACGC GGGTGCGGCG GTCTACGGCT TCAAGTCCCC AGGACCTGCG CTCTTGCCTT TCGACCACGA CGATGTCCGC GTCTACCTCG ACAACCTGTT CTTCGAAGGA TTGCTCCAGC CAGTTCTTCA CGAGCAAGCG CAGACGCTTG CACAGTCTTG GGCAGTGTGT GGCGTGAGAA TCTCGCCAGA GGAAGATCGC CTACGCCGCG TGAGGGGTCT TCTCGATTTT ATCGAGAAGA CGCTTCCCAC AGTAGACTCG CGGTACGCGG AGTGGCTTGA GTTTGCTCCT CGTTGGTCCG AACTCGTTGC GCTGACGTTG GACTCAGACC TGCCTACGCG CGACGCACAG GCAAGCGCTA CGCTACCACC CGGATACGGG GAGCGTTTGG AGGCATTGAA AGCTCAGATC GATGTCACGT TCACCGAATG GATCGTGAAG CGCTACGCAA GTCTCATCAA CCTTCCGCCA CTGCCGCCGG TGATGCTGCA TCACATCCCT CCCTTCCTCG CTAGAAACCT TGACAATGCC CACGATGCAA AAGTCGCTCT TTTGCTCATG GACGGTCTTG CCTTGGACCA GTGGGTTGTC TTGCGTCGTG TACTTGAGGA GTCAGATGCG ACGTTGCGGT TCCGCGACAA CGTCGTATTC GCTTGGATTC CGACTATTAC TTCCGTTTCC CGACAAGCCA CGTTTGCCGG TAAGCCGCCC ATCTATTTCC CAACGAGTAT CCTTACCACG ACCAAAGAGT CGGAACTTTG GACTCAGTTC TGGGTTGGGG AAGGGTTAAC GAAACATGAG GTGGGCTACA TGAAGGGACT CGGAGACGGA AGCTTGGACG GGCTTGCCGA GCTCCTGAGC CGACCCAGGT TACGCGTTGT CGGATTGGTT ATCAACAAGG TCGATCGAAT CATGCACGGC ATGGAGCTAG GATCGGCGGG CATGCACAAC CAAGTTCGAC TATGGGCGAG GCAGGGATTC ATGTGTGATC TCCTTCGTTT GCTCCATGAC CGTGGATTTC AGGTTTATCT CACCTCAGAC CACGGCAACA TCGAGTCGGA GGGCATAGGT GAACCTTCCG AAGGATCGGT CGCGAAGGAA GCTGGCGAGC GTGTTCGGGT CTATTCGGAT CTCAAGCTCA GGGCCCAGGT CAAGGAGCGG TTCCGAGGGG CAGTAGAGTG GCCGCCTTTC GGCCTACCGG AGGACTACCT CGCCCTTATT GCTCCTAACC GAGCCGCCTT CGTACAGAGA GGGAAAACCA GGGTCGCTCA TGGCGGTATC AGTGTCGAAG AGCTTCTTGT CCCCTTTGTC GAAATTGAGA GACAAGATCG ATGA
|
Protein sequence | MGSWRDQILS EFTPQVARLT LVADPDGLLL EEGVLEGIRE RGFELIPFED HVAFRYAYES KFRSRWDRRE ETDLVVVLRS PSSDLDALPY DLLQAGRKLS FNLGDLFPNL SYPVVAVLDR ADLDVLFDAQ LRHAPGPLGD NATKEFVLRH VFEIAPELIK QPSDLLRVLL RRHYRGQRIP HILDERFIQV LRQNGLFEDW PLEAIIPDEQ AFFAFLQERW PVFLDSLANS NDDAVYAGAA VYGFKSPGPA LLPFDHDDVR VYLDNLFFEG LLQPVLHEQA QTLAQSWAVC GVRISPEEDR LRRVRGLLDF IEKTLPTVDS RYAEWLEFAP RWSELVALTL DSDLPTRDAQ ASATLPPGYG ERLEALKAQI DVTFTEWIVK RYASLINLPP LPPVMLHHIP PFLARNLDNA HDAKVALLLM DGLALDQWVV LRRVLEESDA TLRFRDNVVF AWIPTITSVS RQATFAGKPP IYFPTSILTT TKESELWTQF WVGEGLTKHE VGYMKGLGDG SLDGLAELLS RPRLRVVGLV INKVDRIMHG MELGSAGMHN QVRLWARQGF MCDLLRLLHD RGFQVYLTSD HGNIESEGIG EPSEGSVAKE AGERVRVYSD LKLRAQVKER FRGAVEWPPF GLPEDYLALI APNRAAFVQR GKTRVAHGGI SVEELLVPFV EIERQDR
|
| |