Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1571 |
Symbol | |
ID | 6354219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 1688873 |
End bp | 1690414 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642669175 |
Product | transposase IS4 family protein |
Protein accession | YP_001943597 |
Protein GI | 189347068 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0160489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCG ATTTTATCAG AGCTGATCGT GATACCCCGT TTTTGTTTCC TCCATCAGTG CAGGAATGGC TCCCGGCAGA TCATTTGGCC CGGTTTGTCG TTGATATTGT CAGCCAACTC AACCTGTCGT CACTCAAGGA CGTCTATGCC GGTAGAGGAT CAAGGCCATA TGATCCGGCC ATGCTGCTCT CCTTGTTGTT TTATGGCTAT GCGACTGGCG TGTTTTCCAG CCGGAAGCTT GAACAGGCAA CCTATGATTC CGTAGCGTTC CGTTATATCA CCGGTAATCA GCACCCTGAT CACGATACTA TCGCCATGTT CCGGAAGCGG TTTTCAGGAG AACTCAAGGA TTTGTTTCAC CAGATTCTCA TCATTGCTCA TGAGATGGGG ATCCTGAAAA TCGGGACGGT AAGTCTCGAT GGCACGAAAG TCAACGCCAA TGCATCGAAG CATCAGGCGC TGAGCTGGGA TCATGCCGAT AAAATTGCCA GGCAATTGCA AGAAGAGATT GACCAGTTAT GGTTATTGGC AGAACAAGCT GATCAGTCCG TGATTCCCGA TGGCATGAAA ATACCCGAGG AGCTTGAGCG GCGGGAAACC AGACTTGCCA GCATAATTGA AGCCAAACGG AAAATAGAAG CGCGAGCCGA TGAACGCTAT GCAGAAGAAA AGCAAGTATA CGAGAACAAG TTGGCAGAAC GGGAAAAGAG AGAAAAAGAA CGGGGCAAAA AATCCGGAGG CAAGCCTCCC AAACCGCCGG AACCGGGCCC GAAACCACAC GATCAAGTCA ATCTGACCGA TGAAGAATCA CGGATCATGC CTGCAAGTGG CGGCGGTTTT CTGCAGGCGT TCAATGCCCA GGCATGCGTT GATATCGCAA CCTTGCTTAT TGTGGCAGCA CACACGACAC AACAGCCCAA CGATAAAAAA CAGATCGAGC CGGCTCTCGA AGCATTGGGA AACTTGCCCG AAGAACTTGG TCAAGTCAAC GAACTGCTTG CCGATACCGG ATATTACAGT CAGGCAAATG TCGAGGCCTG TGAAGCCGCC GGAATCAATC CACTTATAGC GATATCGCGA GAATCGCATA ATCAGAAACT TGAAGATCGA TTCAGTGAAC CAGAACCAGT ATCCGAGACG GCTGACGGTA TCACCAAAAT GAGGTACCGG TTGAAGAGCA AAGAAGGTAA GGCACTGTAT GCCAAACGGA AGTGCACTGT GGAACCGGTA TTCGGTATCA TCAAATCGGC CCTTGGGTAT CGCCAGTTTT TACGCAGAGG CTTTGAGAAT GTCAATGCGG AATGGACACT GGTAAGCATG GCATGGAATC TGAAACGCAT GCACGTTTTG ACCAAACCAC GCATTAAAAA CCCGGTTGTT GCAGCCTGCA AAGCGAAATC GGATGCTTAT AGCGACCGAT TATCCGAAAT CTGGCGTGTT AAGCTTTTCG AAATGCATCA TGTGGCAAAA ATAGCCACTA TACAGGCTAT TGAAACCATT TCTTCGATGA AATGTCAATT TGTCAGTCCG ACAGGCTGCT AG
|
Protein sequence | MSIDFIRADR DTPFLFPPSV QEWLPADHLA RFVVDIVSQL NLSSLKDVYA GRGSRPYDPA MLLSLLFYGY ATGVFSSRKL EQATYDSVAF RYITGNQHPD HDTIAMFRKR FSGELKDLFH QILIIAHEMG ILKIGTVSLD GTKVNANASK HQALSWDHAD KIARQLQEEI DQLWLLAEQA DQSVIPDGMK IPEELERRET RLASIIEAKR KIEARADERY AEEKQVYENK LAEREKREKE RGKKSGGKPP KPPEPGPKPH DQVNLTDEES RIMPASGGGF LQAFNAQACV DIATLLIVAA HTTQQPNDKK QIEPALEALG NLPEELGQVN ELLADTGYYS QANVEACEAA GINPLIAISR ESHNQKLEDR FSEPEPVSET ADGITKMRYR LKSKEGKALY AKRKCTVEPV FGIIKSALGY RQFLRRGFEN VNAEWTLVSM AWNLKRMHVL TKPRIKNPVV AACKAKSDAY SDRLSEIWRV KLFEMHHVAK IATIQAIETI SSMKCQFVSP TGC
|
| |