Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0431 |
Symbol | |
ID | 4485666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 460788 |
End bp | 461966 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639729198 |
Product | hypothetical protein |
Protein accession | YP_872191 |
Protein GI | 117927640 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.287626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGACGTCA ACCGGGCAAC CGCGGCCGCC GCCCGGCTCG CCGCGCTGCG AGAGCACGTC GGAGTTCCGG GCATCGTGCT CACCACCCCG GGCAACGTTG CCTGGCTCAC CGGCGGTGCC AACACGCCCA TCGACCGCGG CGCAACCACC GACGTCATCT GGGCGGTCGC GACCCCGGAC CACATAGCGA TCATCACAAC CGAGGTTGAA GCGCCCCGAT TGGCCGATGA AGCGGCATTC GGTTCCCTCG GTTGGCCTAT CCACGCGGTG CCGTGGTGGG AGCCCGCCAT GTTCGTCGCC AAGGCCGAAG CGATTGCCGG CGTTCATGCG GACCGCCTCG CCGCCGACGG GCACCCTGCC TTCGGACTCG ACCTCGCCGT CCACCTCGTG CGCACCCGCA TGGCCCTCAC ACCGCCCGAC GTCGCGGCCC TTGCCCGGCT GGGCACAGAC GCCGCCGCAG CCGTCAGCGC CGCGCTTCGC GCCTGGCAAC CCGGGGACAC CGACCGTACG ATCGCCGCTC GGATCGCCGC GGATCTCGAA GAGGTCGGGG CATTTCCCAC CGTTCTCCTC GTCGGCGGCG ACGACCGCCT CCAGCGCTAC CGGCACCCGG TCACCGTTGG TGCTCCGCTC CACGACGTCG TCATGACCGT GCTCGTAGCG GTCCGCGGCG GCCTTCACGT CGCCCTCACC CGCTACGCCG CCCGCACCCC GCCGCCCACC GAGTGGCGGG ACCGGCTCAA CGCCGTCCGG CGGATCCACC GGGCTGTCCT CGACGCCTGC TGGCCCGGAC GCACGGTCGG TGCGGCCCTC CAAGCCCTTG CCGCGGCCTA CGCGGCCGAG GGCGCCCCCG ACGCCTGGCG CCAGCACTAC CAAGGCGGGC CGATCGCCTA CGCGCAGCGG GAGTACGAAC TCGCCCCGGT GCAGACCACG GACCAGTGGT GGAGTCACGT CCTCACCTCC GACACGGCGG TGGCGTGGAA TCCCAGCCTC CCCGGCGGTG CGAAGGACGA AGACACCTAC CTCATCACCC CCGCCGGACC CCGCCTCCTC ACCACCTGCA CGGAATGGCC CATGACCAAC GACGACGTGC CGCGACCCGA CGTCTGGGTC ATCGACGGCA CCCCGCCGAG CGGCGAGCGA CAGCAACAGC CGTTGCCCAT CGGCACGTCG CCGCCGTGA
|
Protein sequence | MDVNRATAAA ARLAALREHV GVPGIVLTTP GNVAWLTGGA NTPIDRGATT DVIWAVATPD HIAIITTEVE APRLADEAAF GSLGWPIHAV PWWEPAMFVA KAEAIAGVHA DRLAADGHPA FGLDLAVHLV RTRMALTPPD VAALARLGTD AAAAVSAALR AWQPGDTDRT IAARIAADLE EVGAFPTVLL VGGDDRLQRY RHPVTVGAPL HDVVMTVLVA VRGGLHVALT RYAARTPPPT EWRDRLNAVR RIHRAVLDAC WPGRTVGAAL QALAAAYAAE GAPDAWRQHY QGGPIAYAQR EYELAPVQTT DQWWSHVLTS DTAVAWNPSL PGGAKDEDTY LITPAGPRLL TTCTEWPMTN DDVPRPDVWV IDGTPPSGER QQQPLPIGTS PP
|
| |