Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0872 |
Symbol | |
ID | 4570466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 993743 |
End bp | 994963 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765468 |
Product | transposase, IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_911345 |
Protein GI | 119356701 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACC TTACCCTTTT TCAGATGGCC TTGGGACTTG AGTCCCCATG GTATGTATCG TCCTCATCAT TTGATGTCGA CCAAAAGCGC TTGGACATAC GAATAGATTT TAAACCGGGC AGCACCTTCT GTTGTCCTCA ATGCGGCCGA GAAGGCGTGA AGGCCTATGA TACCTCCGAG GCAACATGGC GCCATCTCAA CTTCTTTCAG CATGAGGCCT ACCTGACAGT TCGAGTGCCT CGTATATCCT GCCCTGAGTG CGGCATTCTC AAGCTGCAAT CATTTCCCTG GTCTCGCCGT GAGAGCGGCT TTACTCTGCT GTTTGAGGCG ATGATCATGA TCATGGCGAA GTCAATGCCG GTCAAGGCAA TAGCCGCCAT TGTCGGCGAG CATGACACCC GTATCTGGCG GATCATCAAC CACTATGTCG AAAAAGCCCG AGAGCAGGAG GATCACTCGG CAGTCACCAT GGTAGGTGTT GATGAAACCT CCAGCAAGCG CGGTCATAAC TATGTGTCGC TGTTCGTTGA CCTTGCCGTA TCGAAAGTGT TGTTTGCCAC TGAAGGGAAA GATGCAGCAA CGGTCAAGCG ATTCAGTGAA GATCTTGCCG CCCATAAGGG TGATCCGGCA TTGATCACCG AATTCTGCAG CGACATGTCA CCGGCATTCA TCAAAGGGGT CGCCGATAAC TTTACCAATG CCCAACTGAC CTTTGACAAG TTCCATATCA TGCAGGTCAT TAATAATGCT GTCGATGAAG TGCGGCGTCA GGAGCAAAAA GAGCGCCCTG AATTGCAGAG AAGCCGGTAC ATCTGGCTGA AAAACCAGAA CAACCTGAAG GCTTCACAAC GCAAACGCCT TGATGAGTTA TCCTTGCCCC GACTGAATCT GAAAACAACT CGAGCATACC GCATGCGACT AACTTTTCAG GAGTTTTTCG AGCAACCTCA GGTATTGGTG GAAGCATTTC TGAAGAAGTG GTATTTCTGG GCAACCCACA GCCAGCTGCA GCCAATGAAA GAGGCGGCTT ACACCATCAA ACGACACTGG TCTGGCATTC TGCGATGGTT CACTTCTCGT ATCAATAATG GGGTACTTGA GGGAATCAAT AGCCTCATCC AGGCAGCCAA AGCACGGGCA CGGGGTTACC GTACTACCAA AAATCTCATC AATATGATCT ACCTGATCAG CGGGAAGCTT AATTTTGGCT TACCCACTTG A
|
Protein sequence | MNDLTLFQMA LGLESPWYVS SSSFDVDQKR LDIRIDFKPG STFCCPQCGR EGVKAYDTSE ATWRHLNFFQ HEAYLTVRVP RISCPECGIL KLQSFPWSRR ESGFTLLFEA MIMIMAKSMP VKAIAAIVGE HDTRIWRIIN HYVEKAREQE DHSAVTMVGV DETSSKRGHN YVSLFVDLAV SKVLFATEGK DAATVKRFSE DLAAHKGDPA LITEFCSDMS PAFIKGVADN FTNAQLTFDK FHIMQVINNA VDEVRRQEQK ERPELQRSRY IWLKNQNNLK ASQRKRLDEL SLPRLNLKTT RAYRMRLTFQ EFFEQPQVLV EAFLKKWYFW ATHSQLQPMK EAAYTIKRHW SGILRWFTSR INNGVLEGIN SLIQAAKARA RGYRTTKNLI NMIYLISGKL NFGLPT
|
| |