Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2634 |
Symbol | |
ID | 4568750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 3021369 |
End bp | 3022898 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639767195 |
Product | transposase, IS4 family protein |
Protein accession | YP_913042 |
Protein GI | 119358398 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCTG ATTTTATCCT CCCTGACAGG GATACACCGT ATCTGTTTCC GCCATCGGTA CAGGACTGGC TGCCGAAAGA GCATCTTGCC CGGTTTGTCG TAGATATTGT CAGTCAACTC GATCTCTCGT CGTTGAGAAA TTCCTATGCT GGCAGAGGCT CAAAACCATA TGATCCAGCC ATGCTGCTCA CCCTTATTGT CTATGGCTAT GCAACCGGCG TGAGTTCCAG CAGAAAGCTG GAACAGGCGA CCTATGATTC GGTAGCGTTC CGTTACATTA CCGGCAATCA GCATCCCGAT CATGACACCA TTTCGTCGTT CCGCCAGAGA TTTTCAACAG AAGTGAAAGC CCATTTCATC CGGGTTCTTG TCATTGCCAA TGAAATGGGT CTTATGCAGC TTGGTGCCGT CAGCCTTGAC GGCACCAAAA TCAAAGCCAA TGCATCGAAG CATCAGGCGA TGAGCTGGCA GTACGCCTGC AACCTTGAAA AGAAACTGCA GGCGGAAGTC GAACAGTTAT GGCTGCTTGC CGATCAGGCC GATGCATCGT CGATACCCGA CGGGATGAGT ATCCCTGAGG AGCTTACCAT CCGGGAAAAG CGCCTTGAGA CCATCGTCGA GGTGAAAAAG AAAATCGACC AGCGAGCCAG AGAAAGATAT GAGGAAGACA AACAGCTCTA TGAACAAAAG GTTGCGGACA GGGAAAAGAA AGAGAAAACC GGCAAGAAGC CTGGCGGCAA GCCACCGAAA GCTCCCGAGC CGGGCCCCCG ACCGAAAGAT CAGGTCAACC TGACCGACGA AGAATCGAGA ATCATGCCGG CCAGTGGCGG CGGCTTCATG CAGGCCTACA ACGCGCAGGC CTGTGTAGAC ATCGCAACCC TGCTCATTGT CGCCTGCCAC ATGACCCGGA AGCCCAATGA CAAGCAGCAG ATCGAGCCAG CCATTGCAGA ACTGGCAAAG CTGCCTGGAG AGCTGGGCGA GGTAAAAGAA GTGATCACCG ATGCCGGCTA CTTCAGCGCA GCCAACGTTG ATGCCTGTGA AGCTGCAGGA ATCAACCCGC TCATAGCTCT GTCTCGTGAA GCACACAACC AGAGTTTGGA ACAACGGTTC AGGCAACCGG AACCCATTGC AGCAGATGCC GATCCGGTAA CGAAGATGAA GCACAGACTA CAGACCACAG AAGGAAAGGA AACCTACGCA AAACGAAAAT GCACGGTCGA ACCAGTCTTC GGTATCATCA AGTCGGTGCT TGGTCATCGG CAGTTTTTGC GACGAGGGCT CAAGAATGTC CAATCAGAAT GGAACCTGAT CAGTATGGCC TGGAACCTGA AAAGGATGCA CGCATTGGCG AAGCCACGTC CGAAAAAGGG TGAATCAGTG GCCTGCAAAG CACAAAAGGG CAGTCAAATT GGTCTGTTAT GGCGTGTTTT TGACACCAAA CTGGTTTTCA GCCTAAAAAT GAACGTTCAG GCGTTTTTCA GGATAATTTT TTCAACAGGT CTGTCTATCG CAAGGCCGAC AGGCTGCTAG
|
Protein sequence | MRADFILPDR DTPYLFPPSV QDWLPKEHLA RFVVDIVSQL DLSSLRNSYA GRGSKPYDPA MLLTLIVYGY ATGVSSSRKL EQATYDSVAF RYITGNQHPD HDTISSFRQR FSTEVKAHFI RVLVIANEMG LMQLGAVSLD GTKIKANASK HQAMSWQYAC NLEKKLQAEV EQLWLLADQA DASSIPDGMS IPEELTIREK RLETIVEVKK KIDQRARERY EEDKQLYEQK VADREKKEKT GKKPGGKPPK APEPGPRPKD QVNLTDEESR IMPASGGGFM QAYNAQACVD IATLLIVACH MTRKPNDKQQ IEPAIAELAK LPGELGEVKE VITDAGYFSA ANVDACEAAG INPLIALSRE AHNQSLEQRF RQPEPIAADA DPVTKMKHRL QTTEGKETYA KRKCTVEPVF GIIKSVLGHR QFLRRGLKNV QSEWNLISMA WNLKRMHALA KPRPKKGESV ACKAQKGSQI GLLWRVFDTK LVFSLKMNVQ AFFRIIFSTG LSIARPTGC
|
| |