Gene Acel_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1042 
Symbol 
ID4484519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1148279 
End bp1149634 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID639729817 
Producthypothetical protein 
Protein accessionYP_872801 
Protein GI117928250 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.289988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0665058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA TCCTTCTCAA AGGCGGCACC GTCATTACCA TGGACGAACA GATCGGCGAC 
CTACCCACTG GTGACGTCCT TATCGAGGAT GATCGAATCG CCGCCGTTCA ACCGAGCATT
CACGCGGATG CTGAGATTGT TGACTGCACA GGACGCATCG TCATCCCTGG ACTGATCGAT
ACGCACCGTC ACACGTGGGA AGCGGCGATC CGCAACTGTG CGCCGAACGC AACCCTCGAC
GATTACTTCG TGGAGATTCT CGACACCTTC GCTCCTCTCT ATCGAGCTGA CGACGTGTAC
GCCAGCAACC TTGCCGGTGC GCTCGAATGC CTCAATGCCG GCATCACGAC TCTCGTCGAC
TGGTCACACA TCAACAACAC GCCCGAACAT CCGGATGCGG CTATCCGCGG ACTGCAAGAG
GCGGGCATCC GCGCACAGTA CGCCTACGGC AGCGCGAACA CCTCACTGCA GAAATATTGG
TTCTTCAGCG CCGAGGCGAT TCCGGCTGAT GACGTACGGC GCATCCGCAG CACATACTTC
TCCTCGGATC AAGGGCTGCT CACCATGGCG CTTGCCACCC GCGGACCAGG CTTCACCCAA
GATGACGTTG TCCGCGCCGA GTGGGGGCTC GCCCGTGAGC TTGGCATTCC AATAACCGTG
CATGTCGGCA TGGGCCGGCT GGCCGGACGG TACGGCATGG TCGAGCAGCT CGATCGGCTT
GGCTTACTGG GGCCGGATAT CACCTACATT CACTGCTGTT ACTTCAGCGA GCACGAATGG
CGGCGGGTTG CCGACACCGG CGGCACCATA TCCATCGCGC CGCAGGTGGA AATGCAGATG
GGCCACGGCT GGCCGCCGGT ACAGAAGGCA CACCGTTACG GCCTGCGGCC GAGTCTCTCC
ATCGATGTCG TCACCACCGT TCCCGGCGAC ATGTTCACCG AAATGCGTGC CGCATTCGCC
GGCGAGCGGG CCCGGATCAA CGCCGTCTAC TGGGAACTCG ACCAGCCGAT CCCCGAGGAC
ACTCCGACAG CACGCCGAAT GCTCGAAATG GCCACCCGCA ACGGAGCACA CGTGGTGGGA
CTCGAGGATC ACATTGGTTC CCTCACACCG GGTAAGAAAG CGGACGTCGT GATACTGGAC
GCCCGCGCCC TGAACATGGC CCCGGTGCAC GACCCAGTCG CCGCCGTCGT CATCTCGGCC
GACGTGTCCA ACGTCGAGCA CGTCATCGTC AATGGCGGGT TCCGTAAGCG TGACGGAAAG
CTCCTCACCG ACGTGAACCG GGTGCGGACT CTCGTCGAGA ATTCCCGGGA TTACCTCGTG
GCAGCGGCGG CGCAGAAGAA GGAGCAGGTG GGGTGA
 
Protein sequence
MGKILLKGGT VITMDEQIGD LPTGDVLIED DRIAAVQPSI HADAEIVDCT GRIVIPGLID 
THRHTWEAAI RNCAPNATLD DYFVEILDTF APLYRADDVY ASNLAGALEC LNAGITTLVD
WSHINNTPEH PDAAIRGLQE AGIRAQYAYG SANTSLQKYW FFSAEAIPAD DVRRIRSTYF
SSDQGLLTMA LATRGPGFTQ DDVVRAEWGL ARELGIPITV HVGMGRLAGR YGMVEQLDRL
GLLGPDITYI HCCYFSEHEW RRVADTGGTI SIAPQVEMQM GHGWPPVQKA HRYGLRPSLS
IDVVTTVPGD MFTEMRAAFA GERARINAVY WELDQPIPED TPTARRMLEM ATRNGAHVVG
LEDHIGSLTP GKKADVVILD ARALNMAPVH DPVAAVVISA DVSNVEHVIV NGGFRKRDGK
LLTDVNRVRT LVENSRDYLV AAAAQKKEQV G