Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0192 |
Symbol | |
ID | 4808608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 232808 |
End bp | 233878 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640105603 |
Product | integrase catalytic subunit |
Protein accession | YP_001036626 |
Protein GI | 125972716 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTAC AATATAAGTC TACCACAACT GAGCATAAGT TTAAACACTT AAGTGTTTAT GAAAGAGGGC AGATTGCAGC TCTTTTAAAA GAAGGAAAGA GTCAACGTTA TATTGCTAAT AAACTAGGTC GCTCGCCAAG TACAATTAGC CGTGAAATTA AAAGAGGGAC AACAATGCAG ATGAGAACTG ATTTATCGAC ATACAAAGTA TATTTTCCTG AAACAGGGCA GGCAGTTTAT GAGAAAAATC GTATGAATTG CGGAGCAAAG CGTAAATTGG CTCAAGTTGA AGATTTTCTT AAGTTTGCAG AAGATAAGAT ACTACGCGAA AAATGGTCTC CAGATGCAGT TGTTGGTTTA TGTAGGAGAG ACCCCAAGTG GCAAAATTCT ACTATTGTAT GTACCAAAAC ACTGTATAAT TATATAGACC TGGGACTCAT AAAAGTACGA AATATAGATT TAAATCTTAA ACTACGTTTA AAATCTAAAA TAAAAAGGAT ACGTCAAAAC AAACGGGTTG TAGGGAAAAG CATTGATCAA AGGCCGGAAG AAGTACAATC ACGTCAAACC TTTGGGCATT GGGAAATTGA TACGGTAACA GGCAAAAAGT CTAACGATTC AGTAATTTTA ACCTTAACTG AACGAAAAAC CCGCTACGAG TTATTGTTTC TTTTGGACGC AAAAGACAGT AATACTGTTA ACGAGGCACT TTCAGAACTT AAGAATTGTT ATGGTAAGGA TGTTTCAAAT GTATTTCGCA CTATAACGGC AGACAATGGT TCTGAATTTA GTAGACTATC CGAAATGTTA CAAGGGCTAG GAATTGAAGC TTATTTCACT CATCCTTATT CCTCATGGGA GAGAGGAACT AATGAACGTC ATAATGGACT TATTAGGCGT TTTATTCCTA AAGGAAAGGC TATAAAAGAT TTTTCTGAAG AAACGATAAA ACGGATACAA CAATGGTTAA ACAGCCTTCC ACGAAGGATA TTAGGTTACA AAACACCTGA AGAATGTTTT AATGAAGAGA TACATAACCT GGTAAACAAA AATATATCAG CAATAGCCTG A
|
Protein sequence | MAVQYKSTTT EHKFKHLSVY ERGQIAALLK EGKSQRYIAN KLGRSPSTIS REIKRGTTMQ MRTDLSTYKV YFPETGQAVY EKNRMNCGAK RKLAQVEDFL KFAEDKILRE KWSPDAVVGL CRRDPKWQNS TIVCTKTLYN YIDLGLIKVR NIDLNLKLRL KSKIKRIRQN KRVVGKSIDQ RPEEVQSRQT FGHWEIDTVT GKKSNDSVIL TLTERKTRYE LLFLLDAKDS NTVNEALSEL KNCYGKDVSN VFRTITADNG SEFSRLSEML QGLGIEAYFT HPYSSWERGT NERHNGLIRR FIPKGKAIKD FSEETIKRIQ QWLNSLPRRI LGYKTPEECF NEEIHNLVNK NISAIA
|
| |