Gene Acel_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1951 
Symbol 
ID4484921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2211367 
End bp2214783 
Gene Length3417 bp 
Protein Length1138 aa 
Translation table11 
GC content61% 
IMG OID639730743 
ProductHNH endonuclease 
Protein accessionYP_873709 
Protein GI117929158 
COG category[S] Function unknown 
COG ID[COG3513] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02646] conserved hypothetical protein TIGR02646 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGTT CTGAGGTGGG GACGGTGCCT GTCACCTGGC GGCTGGGCGT CGACGTAGGC 
GAGCGGTCAA TCGGCCTCGC GGCGGTTTCC TATGAGGAAG ACAAGCCGAA AGAGATCTTG
GCCGCAGTGT CGTGGATACA TGACGGCGGC GTCGGGGATG AAAGATCGGG AGCGAGTCGG
CTGGCTCTGA GAGGCATGGC GCGCCGTGCT CGCCGGCTGC GACGCTTTCG TCGCGCACGC
CTACGTGATC TCGACATGCT TCTTTCGGAG CTCGGCTGGA CGCCGTTGCC GGACAAGAAC
GTGTCGCCAG TCGACGCTTG GCTTGCTCGT AAGCGGCTCG CCGAGGAATA CGTCGTCGAT
GAGACCGAGC GGCGCCGCCT GCTTGGATAC GCGGTGAGCC ACATGGCACG TCACCGGGGA
TGGCGGAACC CGTGGACGAC CATTAAAGAC CTCAAAAATC TACCGCAGCC TAGCGACTCG
TGGGAGCGCA CTCGGGAGAG TCTGGAAGCC CGGTACTCCG TGTCGCTCGA GCCGGGGACG
GTTGGTCAGT GGGCCGGCTA CCTGCTTCAG CGCGCACCGG GGATTCGGCT GAATCCAACG
CAACAGTCCG CCGGACGCCG CGCCGAGTTA AGTAACGCAA CTGCCTTTGA AACGCGTCTT
CGCCAGGAGG ATGTGCTCTG GGAATTGCGG TGCATTGCCG ATGTGCAGGG GCTGCCGGAG
GATGTCGTGT CAAACGTCAT CGACGCGGTG TTCTGTCAGA AGCGGCCAAG TGTCCCGGCA
GAGCGCATCG GCCGCGATCC CCTTGACCCC AGTCAGCTTC GCGCGTCACG CGCGTGTCTG
GAGTTTCAGG AGTATCGCAT CGTTGCGGCG GTGGCTAACC TGCGGATACG TGATGGCTCT
GGTTCACGGC CGCTTTCCCT TGAAGAGCGC AACGCGGTGA TCGAAGCGTT GCTTGCTCAA
ACCGAGCGTA GCCTCACGTG GTCAGATATT GCGCTGGAGA TTCTTAAGTT GCCGAATGAA
AGCGACCTGA CGAGCGTGCC GGAAGAAGAC GGGCCGTCGT CGCTCGCATA TTCACAGTTC
GCTCCGTTCG ACGAGACGTC GGCCCGTATT GCTGAGTTTA TCGCTAAAAA CCGGCGTAAG
ATTCCAACCT TCGCTCAATG GTGGCAGGAG CAGGATCGCA CGTCGCGTTC AGATCTTGTG
GCCGCCTTGG CCGACAACTC TATTGCCGGC GAGGAAGAGC AAGAGCTCCT TGTGCACCTA
CCGGACGCGG AGCTCGAGGC GCTTGAGGGC CTGGCTCTAC CAAGCGGCCG CGTCGCGTAT
AGTCGGCTGA CCTTGTCTGG TTTGACACGG GTTATGCGGG ATGACGGTGT TGACGTGCAT
AATGCTCGAA AAACGTGTTT CGGGGTTGAC GACAACTGGC GCCCGCCGCT ACCGGCGTTG
CACGAAGCGA CTGGACATCC GGTGGTGGAC CGCAATCTCG CGATTCTGAG GAAGTTTCTT
TCGTCAGCCA CGATGCGCTG GGGTCCACCC CAGAGTATCG TCGTTGAGTT GGCACGGGGA
GCCAGCGAGT CAAGGGAACG ACAGGCGGAA GAAGAAGCGG CCCGTCGTGC CCATCGGAAA
GCAAACGATC GAATCCGCGC TGAACTTCGC GCCTCCGGCC TTAGTGATCC GTCTCCAGCT
GATTTGGTCC GTGCGCGGCT GCTCGAACTC TACGACTGCC ATTGCATGTA CTGCGGCGCG
CCCATTAGCT GGGAGAATTC GGAGCTTGAC CACATTGTCC CGCGTACGGA CGGCGGCTCG
AACAGACATG AGAATCTCGC GATTACCTGC GGAGCGTGCA ACAAGGAAAA AGGTCGACGT
CCCTTTGCGT CGTGGGCTGA GACAAGTAAT AGAGTACAGC TTCGCGACGT GATCGATCGT
GTCCAAAAGC TGAAATACTC AGGCAACATG TACTGGACTC GCGACGAGTT CTCTCGGTAC
AAGAAGTCTG TCGTCGCCCG CCTCAAAAGG CGCACCTCCG ATCCCGAAGT GATTCAAAGC
ATTGAATCCA CCGGCTACGC AGCTGTCGCG CTAAGGGACC GCCTGCTCAG TTACGGCGAA
AAGAACGGAG TAGCTCAGGT AGCGGTTTTC CGCGGTGGCG TGACCGCCGA AGCGAGACGC
TGGCTCGATA TCTCGATCGA GAGGCTCTTC TCCCGTGTTG CAATTTTTGC TCAGTCAACG
AGCACGAAGC GGCTCGATCG TCGGCACCAT GCCGTGGACG CGGTTGTGCT CACAACGCTG
ACTCCGGGCG TCGCGAAGAC CTTGGCCGAC GCGCGGAGTC GTCGCGTCTC AGCGGAGTTC
TGGCGTCGGC CTAGCGATGT GAACCGGCAT TCAACAGAGG AGCCTCAGTC GCCAGCTTAT
CGTCAATGGA AGGAGTCCTG CTCGGGTCTA GGCGACTTAC TCATTAGTAC CGCCGCCCGT
GACAGTATCG CCGTGGCCGC ACCGTTGCGA CTGAGGCCGA CCGGCGCGCT GCATGAGGAA
ACACTTCGCG CCTTCTCGGA GCACACGGTG GGAGCTGCGT GGAAAGGAGC AGAGCTACGA
CGCATCGTCG AGCCTGAGGT TTATGCCGCC TTCCTCGCCC TTACCGATCC TGGTGGTCGA
TTCCTCAAGG TGAGCCCTAG TGAAGACGTG CTACCCGCTG ACGAGAACCG GCATATCGTC
CTTTCTGATA GGGTCTTGGG TCCTCGCGAT CGGGTAAAGC TGTTCCCGGA TGATCGAGGC
TCGATACGGG TGCGCGGCGG TGCGGCGTAC ATTGCATCTT TTCACCACGC CCGCGTCTTC
CGCTGGGGCA GCAGTCATTC CCCGTCCTTT GCTCTGCTGC GTGTTTCACT TGCGGACTTG
GCTGTTGCTG GCCTTCTTCG TGACGGCGTG GACGTCTTTA CTGCTGAGTT GCCGCCGTGG
ACGCCGGCAT GGCGCTATGC CAGCATCGCG TTAGTGAAGG CGGTGGAGTC TGGCGATGCG
AAACAGGTTG GCTGGCTTGT CCCCGGAGAC GAGTTGGATT TCGGCCCGGA AGGTGTGACA
ACGGCAGCCG GGGATTTAAG TATGTTCCTG AAGTACTTCC CGGAGCGGCA TTGGGTCGTC
ACAGGCTTTG AGGATGATAA AAGGATCAAT CTTAAGCCGG CGTTTCTCTC CGCGGAACAA
GCTGAGGTTC TCCGCACGGA GAGAAGCGAT CGTCCCGACA CCTTGACTGA AGCCGGGGAG
ATTCTCGCAC AATTCTTCCC GCGGTGTTGG CGGGCGACCG TCGCAAAGGT CTTGTGCCAC
CCTGGCCTTA CGGTGATCCG ACGAACTGCG CTTGGTCAAC CTCGGTGGCG CCGGGGTCAT
CTCCCGTACT CGTGGCGGCC TTGGAGCGCA GATCCCTGGA GCGGCGGTAC ACCATGA
 
Protein sequence
MGGSEVGTVP VTWRLGVDVG ERSIGLAAVS YEEDKPKEIL AAVSWIHDGG VGDERSGASR 
LALRGMARRA RRLRRFRRAR LRDLDMLLSE LGWTPLPDKN VSPVDAWLAR KRLAEEYVVD
ETERRRLLGY AVSHMARHRG WRNPWTTIKD LKNLPQPSDS WERTRESLEA RYSVSLEPGT
VGQWAGYLLQ RAPGIRLNPT QQSAGRRAEL SNATAFETRL RQEDVLWELR CIADVQGLPE
DVVSNVIDAV FCQKRPSVPA ERIGRDPLDP SQLRASRACL EFQEYRIVAA VANLRIRDGS
GSRPLSLEER NAVIEALLAQ TERSLTWSDI ALEILKLPNE SDLTSVPEED GPSSLAYSQF
APFDETSARI AEFIAKNRRK IPTFAQWWQE QDRTSRSDLV AALADNSIAG EEEQELLVHL
PDAELEALEG LALPSGRVAY SRLTLSGLTR VMRDDGVDVH NARKTCFGVD DNWRPPLPAL
HEATGHPVVD RNLAILRKFL SSATMRWGPP QSIVVELARG ASESRERQAE EEAARRAHRK
ANDRIRAELR ASGLSDPSPA DLVRARLLEL YDCHCMYCGA PISWENSELD HIVPRTDGGS
NRHENLAITC GACNKEKGRR PFASWAETSN RVQLRDVIDR VQKLKYSGNM YWTRDEFSRY
KKSVVARLKR RTSDPEVIQS IESTGYAAVA LRDRLLSYGE KNGVAQVAVF RGGVTAEARR
WLDISIERLF SRVAIFAQST STKRLDRRHH AVDAVVLTTL TPGVAKTLAD ARSRRVSAEF
WRRPSDVNRH STEEPQSPAY RQWKESCSGL GDLLISTAAR DSIAVAAPLR LRPTGALHEE
TLRAFSEHTV GAAWKGAELR RIVEPEVYAA FLALTDPGGR FLKVSPSEDV LPADENRHIV
LSDRVLGPRD RVKLFPDDRG SIRVRGGAAY IASFHHARVF RWGSSHSPSF ALLRVSLADL
AVAGLLRDGV DVFTAELPPW TPAWRYASIA LVKAVESGDA KQVGWLVPGD ELDFGPEGVT
TAAGDLSMFL KYFPERHWVV TGFEDDKRIN LKPAFLSAEQ AEVLRTERSD RPDTLTEAGE
ILAQFFPRCW RATVAKVLCH PGLTVIRRTA LGQPRWRRGH LPYSWRPWSA DPWSGGTP