Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0976 |
Symbol | |
ID | 4485392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1075536 |
End bp | 1077383 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639729751 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_872735 |
Protein GI | 117928184 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00986526 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.875047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCGG TAACGGAGAT CCCCGCGACT GCGACACTCA CGGACGCCGT CCTCGACCAT GCGCGAACGC GTCCGGACGC CGTGCTCTTC CGCCGCCGTC CCGTGCGCCC GGCCCGCGAG AGCCGGCCGC CCGGCCCAGC CGCGCGCGCG ACCGAACCGC CGACCTGGAC GCCGATCACC GCTGCGCAGT TCCACGCTGC GGTCGACGCC CTGGCCCGGG GCCTGCTGGC CGCCGGACTG CAACCGGGCG CCCGGGTCGC CCTGCTCTCC CGAACCCGCT ACGAGTGGAC GCTGGTCGAC TACGCCCTCT GGCACGCAGG TCTGATCACC GTGCCGATCT ACGAGACATC CTCACCGGAC CAGATCGGTT GGATTCTCGG TGACGCGCAG GTCGCCGCCG CGATCGTCGA GAGTCCCGAA CATGCCCGCG TTGTCAACGC CGTCCGCGAT GTGGTGCCCA ATCTGCAGCA CATCTGGGTG ATCGAGGACG GCGATCTCGA TCGGCTCACA GCCCAGCGGG CCTCCGACGA CCAGCTGGCC TCCGCCCGGC GCGAGCTGGG CGCCGGATCC GTGGCGACCA TCGTCTACAC CTCCGGAACC ACCGGACGGC CCAAGGGATG CGTGCTCACG CACGGCAATC TGCTGTTCGC CGCGCGCAGC GCCGTCGGCA GCCTGCCGGC GCTCTTCCAT GAGCACACGT CGGCGCTGCT GTTTCTCCCG CTCGCTCACG TGTTCGCCCG CGAGATCCAG GTCGCCTGCG TCGAAGCTGC CGTGCCGGTC GGGCACTGCC CGGACACCTA CCAACTCCAG ACCGACCTCG CGTCCTTCCG CCCGACCCTC CTCCTCGCGG TGCCGTACCT GCTGGAGAAG GTGTACTGGC TGGCGGCCCG GACCGCCGAG CAACGCGGCA CGTCACGGCT TTTCGCCGCC GCCGTCCGGA ACGCCCAGCA GGTGAGCGCC GGGCCGGTCA CGGCTGCCGT TCGCGTCCGG CGGGCGCTGT TCGACCGGCT CGTCTACCGC CGCATTCGCG CCGCCCTCGG CGGGGCGGTC GAGTGGGTCG TCTCCGGCGG AGCGCCGCTG GCACCCCATC TCGGGCACTT CTTCCGCGGT GCGGGCATCC CGGTGTTGGA GGGCTGGGGA TTGACCGAGA CGACCGCGGC CGCGACCGTC AACCGACCCG ACGCCACCAA GATCGGAACC GTTGGACTGC CGCTTGCCGG GACGGAGGTC GGGCTGACGG CGGACGGCGA ACTGCTGGTT CGCGGCGGCC ACGTCTTTGC CGGCTACTGG GGGGATCCCG CCGCAACCCA GGAGGTGCTG GACGCCGACG GGTGGCTGCA CACCGGGGAT CTCGGTGAGA TCGATGACGA CGGCTTTGTC ACGATCACCG GTCGCCGGAA GGAGATTCTG GTGACCGCGG GCGGGAAGAA CGTCGCACCC GCGGTGCTGG AGAACCGGGT GGCCGGTCAC CCGCTGGTCG CCCACTGTGT CGTCGTCGGC GACGGCCGGC CGTACGTGGC CGCGCTCATC ACCCTTGACC CCGAAGCCGT TGACGCGTGG AAACAGAAGA TGGGCAAGCC GGCAAGCCTG ACCCTCGCCG AGCTGCGGGA CGATCCGGAC CTCGTCGCGG AGATCCAAGC CGCAGTGGAC GAAGCAAACC AGGCCGTCTC GAGAGCCGAG TCGATTCGCC GGTTCCGCAT TCTCGACACC GAGTTCCGGC AAGACACCGG TCAGCTCACC CCGACCCTCA AGGTGCGGCG TGACGTCATC GCCGCTCAGT TCGCCGCCGA GATCGACGAA CTCTACACAC GGTTGCCTCA GCCCGCTCCA CCTTCCAGCC GACATTGA
|
Protein sequence | MPAVTEIPAT ATLTDAVLDH ARTRPDAVLF RRRPVRPARE SRPPGPAARA TEPPTWTPIT AAQFHAAVDA LARGLLAAGL QPGARVALLS RTRYEWTLVD YALWHAGLIT VPIYETSSPD QIGWILGDAQ VAAAIVESPE HARVVNAVRD VVPNLQHIWV IEDGDLDRLT AQRASDDQLA SARRELGAGS VATIVYTSGT TGRPKGCVLT HGNLLFAARS AVGSLPALFH EHTSALLFLP LAHVFAREIQ VACVEAAVPV GHCPDTYQLQ TDLASFRPTL LLAVPYLLEK VYWLAARTAE QRGTSRLFAA AVRNAQQVSA GPVTAAVRVR RALFDRLVYR RIRAALGGAV EWVVSGGAPL APHLGHFFRG AGIPVLEGWG LTETTAAATV NRPDATKIGT VGLPLAGTEV GLTADGELLV RGGHVFAGYW GDPAATQEVL DADGWLHTGD LGEIDDDGFV TITGRRKEIL VTAGGKNVAP AVLENRVAGH PLVAHCVVVG DGRPYVAALI TLDPEAVDAW KQKMGKPASL TLAELRDDPD LVAEIQAAVD EANQAVSRAE SIRRFRILDT EFRQDTGQLT PTLKVRRDVI AAQFAAEIDE LYTRLPQPAP PSSRH
|
| |