Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0821 |
Symbol | |
ID | 4486309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 913514 |
End bp | 916267 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639729594 |
Product | SMC domain-containing protein |
Protein accession | YP_872580 |
Protein GI | 117928029 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0839342 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTCG ACTCCGCTAC CGTCCGCCGC TATCGGCTGC ACCGGGACCT CACGGTCGAA TTCGACCCGT CGCGCACGCT GATTACCGGA GATAACGAGA CCGGCAAGAG CACGCTGGTC GAGGCGATTC ACCGCGCCCT CTTCCTCAAG GCCACTGTCA CCGGCAGCGT CCTCGCGGAG ATGCGGTCTC ACCTGTACGG CAGCTATCCG GAGGTGGAAC TCCGGTTCAC CGTCGGCGGT GACACGTACC AGCTGCGCAA ACGCTTCGCC GGCCAGCAAG GCACCGCAAC CTTGAGCCAG GTCGGCGGCC GAACCTGGCA CGGCGAAGAA GCCGAGGAAC GGCTTGCCGA GCTGCTCGGC GTCGAACCCA CCGGCGGCGG CCGGGGCATC GATAAACGGC TCAACGCCCA GTGGGCGCAT CTGTGGGTCT GGCAGGGACA AGGCGGCGAG GATCCCGCCC AATACACGGC CGCCCACAAC CAGCAACTCC TTCAGCAGCT GCAGCGCGTC GGCGGTGCGG TGGCCATGCA ATCTGCCCTG GACGGCCGGG TCGCTGAGCA TTTCGCGCAG CTGTACGACC GCCTGTTCAC CCAGACCGGA AAGGAAAGAG CCGGGTCAGA CCTGGCCGCC GCCAAAACCG CTTATGAGCA GGCCCGGGAG CGGTTCGAGC AGGCGGCGCT GCGCCTTGAG CGCGTCGAAA CCGCGATGAG TGACTTCGAG CACGCCGAGG AAACCATCCG CGAGGCCGCC GCCAGCCGGG AGGACGCCCA GCGGCAACTC CAGGCGGTCA ACGAAAAATT GGAGAAACTC ACCAGACTGC GCGACCAGGA GCAGGCGCAG GCCCGTGAAT TGGCCGAAAT CCAGGGCCGG TTGGCGGCCG TTGAAGAGAC CGACCGGAAA ATCCGGGAGT ACCAGGACCA GGCCCGAAAG CTGCAGGACG ACGCCGTCCC CTTGCACGCC AAGCTGGCGG ACGCCGACCA GCGCTTGCGC GATGTACGGG ACCAGGTAGG CCGCGCGCAG AAAATCCGTG ACGAGGCGGC GGAAAAAGCC CGGCAGGCCC GCAGAATCCG GGATTTGGCC GTTGCGTATG AACGGATGCT GGACGCCGAG CAGCAGCATC AGGCTCTCGC CCTGCGGGTG GCGGAGATCG AACGCCACGA GCAGACCATC CGGGAACGCG GCAGCCGGCT TGCCCAGCTC CCGGCCATTG ACCGAAACGA CCTGGAGGAG CTTCAGGAGC GCGATCGCGA GCGCGGCGAG GCCGACGCCC GTCTGACCGC AGCTGCGCCG GAAATTGAAC TGCTCGAATC CTCCCTGCCC GTGCGGCTCG GCGACCGGGC GCTGTCGCCC GGGGAGCCGG TGAGAGTTCT AGAAACCACC GAATTGACAG CCGGCGACCT GCGGGTGCGC ATTCACCCCG GCCGTGGGGA GAATCTCGCG TCCCTTCGGG CCCGGATCGC AGAACTCGGT CGCGAGATCC AGGAAAAGCT GGACCGATGG GGGCTGCCGG GGATCGCCGA AGCCGCTGCG GTTGTCGTGC AACGGCAAGA GATTCAGCAG GAGATCGACC GGCTGCAGGC GGCGCGTGAG GCGCTGAATC CGGAATCGAC CCGGGCGCAA TTCCAGGAGG CCGAGCAGCG CCTGATCGCC GCCCGGGCCG AACGGGACCG GCTGCAGCCA ACCGTGGGCG ACGGGTACGC CGTCCCGGGA AACCTTGAGG CCGCCCGGCA CCAGGTTGAC GAGGCACAAC AGCGCGTGGA TGAGGCCGAA GCCGGCGAGC AGCACGCCGC CGGCGTCCTG GAGACCCTGA CCGCGGAGTT GGCGCAACTC ACCGCGGATC GGGAGAGCGT CGAGCAATCG CTCCGGGAGC TCCAACGCCA ACAACACGAA GCGGACGCCG CAATTCGCGT CCTCGTCGAA CAAGCCGGCG ACGAGACCCA GCGCCGCAGC CACCTCCAGG AATTGCACGA GGCGGTTCAG ACCCTGGAAA AACGCCTCGA AGAAACCCGT GCTGCGATCG CCGAATTACA ACCCGACCTT CTCGACGCGG ATCGGGAACG CCTCGAGCGT GTCCTCCACA ACGCCCAGGA AAGCATCCGG CTGTCGGAAC AGCGGCGGGC CGCAGCCCAC GCGCTCCTGC AGACCGACGG CAGCACGGAT CCCCGCGCCG AATACGCCCA CGCCCGGGCG CAGCTGGCTG AGGCCGATGA GCGGCGGAAG GCCGCGGAGC GTTACGCAGC CGCCGTCCGG CTGGTGCACG AACTCTTCGC CGCTCAGCAA AAACAGCTTG CCGAGCATTT CTCGCGGCCG CTGGCTGAAA AAATCACCAC CTACCTTCAG CCCGTGTTCG GACCCCGCGT GCAGGCGGTG GCGAAATTCG ACGGCAACGA TTTCAAAGGC GTCGAGCTGG TTCGCCCGGC CCTGGACGCC GCGCTGCCGT TCGACGCGCT CAGCGGCGGG ACACGCGAAC AGGTGGCCGC CGCGGTACGC CTGGCGATCG CCGAACTGCT GGCGGCCGAC CATGACGGCA CGTTGCCCGT CGTCTTCGAC GATGCCTTCG CCAATTCCGA CCCGCACCGC ATCGCGCTGC TGCAGCGCAC CCTCGACCTC GGCGCCCGCC GCGGCCTCCA GATCATCGTT CTGACCTGCC ACGGCACCTC CTATTCCGCG CTGGGGGCCC GGCACATCGA GCTCGAATCA CCGAGAGCAC CTGCGGATTC CCGCCCGGTG TCGGTGACCG ATCCTGCGCA GTAG
|
Protein sequence | MRLDSATVRR YRLHRDLTVE FDPSRTLITG DNETGKSTLV EAIHRALFLK ATVTGSVLAE MRSHLYGSYP EVELRFTVGG DTYQLRKRFA GQQGTATLSQ VGGRTWHGEE AEERLAELLG VEPTGGGRGI DKRLNAQWAH LWVWQGQGGE DPAQYTAAHN QQLLQQLQRV GGAVAMQSAL DGRVAEHFAQ LYDRLFTQTG KERAGSDLAA AKTAYEQARE RFEQAALRLE RVETAMSDFE HAEETIREAA ASREDAQRQL QAVNEKLEKL TRLRDQEQAQ ARELAEIQGR LAAVEETDRK IREYQDQARK LQDDAVPLHA KLADADQRLR DVRDQVGRAQ KIRDEAAEKA RQARRIRDLA VAYERMLDAE QQHQALALRV AEIERHEQTI RERGSRLAQL PAIDRNDLEE LQERDRERGE ADARLTAAAP EIELLESSLP VRLGDRALSP GEPVRVLETT ELTAGDLRVR IHPGRGENLA SLRARIAELG REIQEKLDRW GLPGIAEAAA VVVQRQEIQQ EIDRLQAARE ALNPESTRAQ FQEAEQRLIA ARAERDRLQP TVGDGYAVPG NLEAARHQVD EAQQRVDEAE AGEQHAAGVL ETLTAELAQL TADRESVEQS LRELQRQQHE ADAAIRVLVE QAGDETQRRS HLQELHEAVQ TLEKRLEETR AAIAELQPDL LDADRERLER VLHNAQESIR LSEQRRAAAH ALLQTDGSTD PRAEYAHARA QLAEADERRK AAERYAAAVR LVHELFAAQQ KQLAEHFSRP LAEKITTYLQ PVFGPRVQAV AKFDGNDFKG VELVRPALDA ALPFDALSGG TREQVAAAVR LAIAELLAAD HDGTLPVVFD DAFANSDPHR IALLQRTLDL GARRGLQIIV LTCHGTSYSA LGARHIELES PRAPADSRPV SVTDPAQ
|
| |