Gene Acel_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1842 
Symbol 
ID4485449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2083315 
End bp2084184 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content65% 
IMG OID639730632 
Producthypothetical protein 
Protein accessionYP_873600 
Protein GI117929049 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0573823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.171511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACG ACCGGGCGAA GGTGCCGCCG CGGGATGAGT CATCCACCGA TGCGCGGCAG 
AACGCCGCTG CGCGCCGGCA GGCGGACCCG GCATCCACCG GCAACCACGG CAACCACACC
AACCACCACG TCGTCGAGAA GCGGCGCGGG CCGGTCCTGC TCCGCCGTTC CCAGGTCTCC
ACCACGACGA CGGATCAGCG GTTGCTCGAC AGCCGTGGCC CGTCCGACTG GGTGCACACC
GATCCGTGGC GCGTGCTGCG GATCACCTCG GAATTCGTGG AAGGGTTCGG GCTGCTTGCC
GAGCTCGGCG CCGCCGTCTC GGTCTTTGGT TCCGCGCGGA CCACGCCGGA TCACCCGGAT
TACGCGGCCG CGGAGAAACT CGGCGCCGCG CTGGCCCGCG CCGGGTACGC CGTCATCACC
GGCGGTGGTC CGGGCGTCAT GGAGGCGGTG AACAAGGGAT GCAGCGAGGC CGGCGGAGTC
TCCGTTGGAC TGGGCATCGA GCTCCCCTTC GAGCAACGCC TCAACGATTG GGTGGACATC
GGCATTCAAT TTCGGTACTT CTTCGCCCGC AAGACCATGT TCGTGAAGTA CGCCCAAGGC
TTTGTCGTTT TCCCCGGCGG TTTCGGCACG CTGGATGAGC TCTTCGAAGC GCTGACCTTG
GTGCAGACAC GCAAGGTCAC CTCGTTTCCC GTCGTCTTGT ACCGCGAAGA GTACTGGCAT
GACCTCATCG AATGGACCCG CCGGCGCATG CTGGACGAAG GAAAGATTTC ACCGGAAGAT
CTCGATTTGT TCTCCGTAAC CGATGACGTC GATGAGATCG TGGAGATTAT GGAGCGTGCG
GAAGCGGCGC GCTACGGAGC GGCATCCTGA
 
Protein sequence
MSNDRAKVPP RDESSTDARQ NAAARRQADP ASTGNHGNHT NHHVVEKRRG PVLLRRSQVS 
TTTTDQRLLD SRGPSDWVHT DPWRVLRITS EFVEGFGLLA ELGAAVSVFG SARTTPDHPD
YAAAEKLGAA LARAGYAVIT GGGPGVMEAV NKGCSEAGGV SVGLGIELPF EQRLNDWVDI
GIQFRYFFAR KTMFVKYAQG FVVFPGGFGT LDELFEALTL VQTRKVTSFP VVLYREEYWH
DLIEWTRRRM LDEGKISPED LDLFSVTDDV DEIVEIMERA EAARYGAAS