Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_2144 |
Symbol | |
ID | 4485612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 2426600 |
End bp | 2427724 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639730946 |
Product | NLP/P60 protein |
Protein accession | YP_873902 |
Protein GI | 117929351 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCC TTTACGGCCG GCGCGGAAGT GCTGTGCGCC GTCGGCGTCG CATCGTCGGT ATGGCGTTCG CGCTGCTCAG CCTGGTCGGT GCGACCGTGA TTCTCCCGCC CCGGGCGGCG TCTGGTGATC CCCTCGACGA CGCCCGCGCC AAAGCAGCCG CCCTCCGTGA CACCGTCGCG CGGCTGCAGC TGCAGGCGGA GCAGGCATCG GAACGGTACG ACGCCGCCGA GGCACAGCTC GCGGATCTCG TCGTACGGGA ACGCGCCGCG CAACTGCGGG CTGACGCCGC GCGCTCAGCC CTCGATGCGG CGCAGGCGAC TCTTGTCGGC CGGATCCGTG CGCTGTACAT GGCCGGCGGG ACACTCGGGA TGTACGCGAC CGTGTTGAGC GGCGGGAACC CCGCGCAGAT CCTCACCGGC CTGCACGACG TCGCGGTGCT CTCCACCGGT GACCGACATG CATTGCAGAT CGTCCAGCGC AGCCGCGCTG AGCTGGACGC CGCCGCCGCC GCAGTGGCGG CACTGGTTCA GCAGCACGCG GACCTGCTCG CCGCCGCAGC CGCCGCCGAG GCGCAGGTGC AGCAAGCACT GGCTGAGCAG CAGGCGGCGT TGGACGCCGC CACTGCTCAA GTCCGTGCGC TCGAGGCTCA GCTCGAGGCC CAGCTTGAGG CCCAGCGCGC GGCCGAGGCT GCCGCCGCGC TCGCGGCCGC CCGGCAAGCC GCCTTTCAGG CCGGTTATCG CCCACCGCAG CCGAGCCGCA TCGCGCTTGC CGCGATCGCG GCTGCGGAGA CGCAGCTCGG CAAGCCGTAC GAGTACGGCG GTTCCGGACC CGACAGCTGG GATTGCTCAG GGCTCACGCA ATTTGCGTAC CGGCAGGCCG GCGTTTTCCT GCCCCGCACC GCGGCGGAAC AATTTCTCGC CGTCGCCGAG AAGGTTCCGC TCGGTGAGCT CATCCCTGGG GATTTGCTCT TCTGGGCCAC CGATCCGACG AATCCGGCCA CGATTCATCA CGTCGCGATT TATCTGGGCG ACGGCCGAAT GCTTGCCGCG CCGCACACGG GAACCGTCGT CCAAATCCAA GATGTCTACC TCGACGGATA TTTCGGCGCG GTGCGGCCGG GTTGA
|
Protein sequence | MTVLYGRRGS AVRRRRRIVG MAFALLSLVG ATVILPPRAA SGDPLDDARA KAAALRDTVA RLQLQAEQAS ERYDAAEAQL ADLVVRERAA QLRADAARSA LDAAQATLVG RIRALYMAGG TLGMYATVLS GGNPAQILTG LHDVAVLSTG DRHALQIVQR SRAELDAAAA AVAALVQQHA DLLAAAAAAE AQVQQALAEQ QAALDAATAQ VRALEAQLEA QLEAQRAAEA AAALAAARQA AFQAGYRPPQ PSRIALAAIA AAETQLGKPY EYGGSGPDSW DCSGLTQFAY RQAGVFLPRT AAEQFLAVAE KVPLGELIPG DLLFWATDPT NPATIHHVAI YLGDGRMLAA PHTGTVVQIQ DVYLDGYFGA VRPG
|
| |