Gene Acel_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2144 
Symbol 
ID4485612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2426600 
End bp2427724 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content70% 
IMG OID639730946 
ProductNLP/P60 protein 
Protein accessionYP_873902 
Protein GI117929351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC TTTACGGCCG GCGCGGAAGT GCTGTGCGCC GTCGGCGTCG CATCGTCGGT 
ATGGCGTTCG CGCTGCTCAG CCTGGTCGGT GCGACCGTGA TTCTCCCGCC CCGGGCGGCG
TCTGGTGATC CCCTCGACGA CGCCCGCGCC AAAGCAGCCG CCCTCCGTGA CACCGTCGCG
CGGCTGCAGC TGCAGGCGGA GCAGGCATCG GAACGGTACG ACGCCGCCGA GGCACAGCTC
GCGGATCTCG TCGTACGGGA ACGCGCCGCG CAACTGCGGG CTGACGCCGC GCGCTCAGCC
CTCGATGCGG CGCAGGCGAC TCTTGTCGGC CGGATCCGTG CGCTGTACAT GGCCGGCGGG
ACACTCGGGA TGTACGCGAC CGTGTTGAGC GGCGGGAACC CCGCGCAGAT CCTCACCGGC
CTGCACGACG TCGCGGTGCT CTCCACCGGT GACCGACATG CATTGCAGAT CGTCCAGCGC
AGCCGCGCTG AGCTGGACGC CGCCGCCGCC GCAGTGGCGG CACTGGTTCA GCAGCACGCG
GACCTGCTCG CCGCCGCAGC CGCCGCCGAG GCGCAGGTGC AGCAAGCACT GGCTGAGCAG
CAGGCGGCGT TGGACGCCGC CACTGCTCAA GTCCGTGCGC TCGAGGCTCA GCTCGAGGCC
CAGCTTGAGG CCCAGCGCGC GGCCGAGGCT GCCGCCGCGC TCGCGGCCGC CCGGCAAGCC
GCCTTTCAGG CCGGTTATCG CCCACCGCAG CCGAGCCGCA TCGCGCTTGC CGCGATCGCG
GCTGCGGAGA CGCAGCTCGG CAAGCCGTAC GAGTACGGCG GTTCCGGACC CGACAGCTGG
GATTGCTCAG GGCTCACGCA ATTTGCGTAC CGGCAGGCCG GCGTTTTCCT GCCCCGCACC
GCGGCGGAAC AATTTCTCGC CGTCGCCGAG AAGGTTCCGC TCGGTGAGCT CATCCCTGGG
GATTTGCTCT TCTGGGCCAC CGATCCGACG AATCCGGCCA CGATTCATCA CGTCGCGATT
TATCTGGGCG ACGGCCGAAT GCTTGCCGCG CCGCACACGG GAACCGTCGT CCAAATCCAA
GATGTCTACC TCGACGGATA TTTCGGCGCG GTGCGGCCGG GTTGA
 
Protein sequence
MTVLYGRRGS AVRRRRRIVG MAFALLSLVG ATVILPPRAA SGDPLDDARA KAAALRDTVA 
RLQLQAEQAS ERYDAAEAQL ADLVVRERAA QLRADAARSA LDAAQATLVG RIRALYMAGG
TLGMYATVLS GGNPAQILTG LHDVAVLSTG DRHALQIVQR SRAELDAAAA AVAALVQQHA
DLLAAAAAAE AQVQQALAEQ QAALDAATAQ VRALEAQLEA QLEAQRAAEA AAALAAARQA
AFQAGYRPPQ PSRIALAAIA AAETQLGKPY EYGGSGPDSW DCSGLTQFAY RQAGVFLPRT
AAEQFLAVAE KVPLGELIPG DLLFWATDPT NPATIHHVAI YLGDGRMLAA PHTGTVVQIQ
DVYLDGYFGA VRPG