Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1042 |
Symbol | |
ID | 4484519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1148279 |
End bp | 1149634 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639729817 |
Product | hypothetical protein |
Protein accession | YP_872801 |
Protein GI | 117928250 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.289988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0665058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGA TCCTTCTCAA AGGCGGCACC GTCATTACCA TGGACGAACA GATCGGCGAC CTACCCACTG GTGACGTCCT TATCGAGGAT GATCGAATCG CCGCCGTTCA ACCGAGCATT CACGCGGATG CTGAGATTGT TGACTGCACA GGACGCATCG TCATCCCTGG ACTGATCGAT ACGCACCGTC ACACGTGGGA AGCGGCGATC CGCAACTGTG CGCCGAACGC AACCCTCGAC GATTACTTCG TGGAGATTCT CGACACCTTC GCTCCTCTCT ATCGAGCTGA CGACGTGTAC GCCAGCAACC TTGCCGGTGC GCTCGAATGC CTCAATGCCG GCATCACGAC TCTCGTCGAC TGGTCACACA TCAACAACAC GCCCGAACAT CCGGATGCGG CTATCCGCGG ACTGCAAGAG GCGGGCATCC GCGCACAGTA CGCCTACGGC AGCGCGAACA CCTCACTGCA GAAATATTGG TTCTTCAGCG CCGAGGCGAT TCCGGCTGAT GACGTACGGC GCATCCGCAG CACATACTTC TCCTCGGATC AAGGGCTGCT CACCATGGCG CTTGCCACCC GCGGACCAGG CTTCACCCAA GATGACGTTG TCCGCGCCGA GTGGGGGCTC GCCCGTGAGC TTGGCATTCC AATAACCGTG CATGTCGGCA TGGGCCGGCT GGCCGGACGG TACGGCATGG TCGAGCAGCT CGATCGGCTT GGCTTACTGG GGCCGGATAT CACCTACATT CACTGCTGTT ACTTCAGCGA GCACGAATGG CGGCGGGTTG CCGACACCGG CGGCACCATA TCCATCGCGC CGCAGGTGGA AATGCAGATG GGCCACGGCT GGCCGCCGGT ACAGAAGGCA CACCGTTACG GCCTGCGGCC GAGTCTCTCC ATCGATGTCG TCACCACCGT TCCCGGCGAC ATGTTCACCG AAATGCGTGC CGCATTCGCC GGCGAGCGGG CCCGGATCAA CGCCGTCTAC TGGGAACTCG ACCAGCCGAT CCCCGAGGAC ACTCCGACAG CACGCCGAAT GCTCGAAATG GCCACCCGCA ACGGAGCACA CGTGGTGGGA CTCGAGGATC ACATTGGTTC CCTCACACCG GGTAAGAAAG CGGACGTCGT GATACTGGAC GCCCGCGCCC TGAACATGGC CCCGGTGCAC GACCCAGTCG CCGCCGTCGT CATCTCGGCC GACGTGTCCA ACGTCGAGCA CGTCATCGTC AATGGCGGGT TCCGTAAGCG TGACGGAAAG CTCCTCACCG ACGTGAACCG GGTGCGGACT CTCGTCGAGA ATTCCCGGGA TTACCTCGTG GCAGCGGCGG CGCAGAAGAA GGAGCAGGTG GGGTGA
|
Protein sequence | MGKILLKGGT VITMDEQIGD LPTGDVLIED DRIAAVQPSI HADAEIVDCT GRIVIPGLID THRHTWEAAI RNCAPNATLD DYFVEILDTF APLYRADDVY ASNLAGALEC LNAGITTLVD WSHINNTPEH PDAAIRGLQE AGIRAQYAYG SANTSLQKYW FFSAEAIPAD DVRRIRSTYF SSDQGLLTMA LATRGPGFTQ DDVVRAEWGL ARELGIPITV HVGMGRLAGR YGMVEQLDRL GLLGPDITYI HCCYFSEHEW RRVADTGGTI SIAPQVEMQM GHGWPPVQKA HRYGLRPSLS IDVVTTVPGD MFTEMRAAFA GERARINAVY WELDQPIPED TPTARRMLEM ATRNGAHVVG LEDHIGSLTP GKKADVVILD ARALNMAPVH DPVAAVVISA DVSNVEHVIV NGGFRKRDGK LLTDVNRVRT LVENSRDYLV AAAAQKKEQV G
|
| |