Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1429 |
Symbol | |
ID | 4568990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1623620 |
End bp | 1625194 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766015 |
Product | integrase catalytic subunit |
Protein accession | YP_911881 |
Protein GI | 119357237 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.398273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAACA AGGCCTTGAC TATGTTACAA GTTCGACGTA TTCTCAAACT CTTGATGGAG GAGTGTTCCC AACGGGAAAT CCATCGCAGT ACAGGTATTC ACCGCGTCAC CATCAAAAGC TATCTGCACC GGTTTACGAG CAGCGGAAAA CCGTTTTCAG AGCTGTATGC GCTCTCTGAT TACGATCTTT CTGTTCTGGT TCACCCACCC CGTTCCACCA AAACCTCTGA TGAACGGTAT GCAGATCTCC AGCCCCAACT GCAACGTTTT TCTGATGAGC TGAACAAGAC GAACTCTCAT GTTACCAAGC AGGTGTTATG GGAAGAGTAT CTTCAGGATC GACCTACCGG GTATCAATAT TCCCAGTTTT GCTATCACGT GGATCAGTAC ATAAAACAGC ATGCCGTCAC GATGCCGCAG CAGCATGAGC CGGGCTACCG ACTGCAGATC GACTTTGCTG GTGATCCGCT CTGGATTATC GACCCGCTTA CCAGAGAACG CATCAAGTGC CCGGTTCTGG TCTGCACGTT GCCTTGCAGC AGCTTTTTTT ACGTTGAACC GCTCTCATCT TGCAGGCAGG AGCACCTGAT TCCTGCACTC AATCGGGCGC TTGCCTATTT TGGCGGTGTT CCCAAAAACA TTCTGAGCGA CAACATGAAA CAGGTCGTGA CAACAGCATC ACGCTATGAG CCTGTTTTCA ATGATCTTAT GGAACAATGG GCCTTGCACT ATCAGACCAA CATGCAGGCA ACCAGAGCCG TCAGGCCCAA GGATAAGCCA TCTGTTGAAG GCTCGGTGCA CCATGCTTAT CAGCAGATTT ACGCAAGGTT GCGCAATGAG GAGTTCACCA GTCTGAGTGC GTTGACGTAT CGGGTTCGGC ATCTGCTTGA TACGGCCAAT GATCGGCTGA TGACCGATTA TGGCAAGAGT CGCAGACAGC GGTTTATAGA ACTTGAGCAA GAGTTTTTAC AGCCACTGCC GCTGACTGAT TTTGTGTACA AGCGTGAAAC AACTGCCAAA GTCAAGAAAA ATTATCATGT CATTCTGGGC GAAGACCGCT GCCAGTACAG TGTTCCGCAT GAGCATATCG GCAAAATCGT CAAGCTGATC TATGATGAAT CGGTGGTTGA GGTATTTCTT GATTTCCAGC GTATCGCCTT GCATCAGCGC ATCGTCGGAC GCCGGGGCAT CTACAGAACT GTCGAGGAAC ATATGCCGGA ATCACATCGC CGGTACCATC AGCAACAAGG GTGGACTGAG GAGGACTTTA CCAGCAAAGC TGCCGCTGTC GGGCCCTGTA CCGAGGAAGC TGTTTTGCGG CTTCTGAGTT CAAAAGCTTT TGCACAACAG AGCTTTGATG CCTGCCTGGG CATTCTCCGG CTCCAGAAAA AGTATGGAAC AACAAGACTC GAAGCGGCTT GCAGTGTAGC CCTGCAAGTC CCACGCCTCA ACTATCGACT CGTCAACAAC ATTCTGGAAA ACAACAGGGA CAAGGTCTCT GTTGCAGCAG GAGAACAGCG TGCATCACTG CTTCCGTTGC ATGACAATAT TCGCGGTAAA GAAGTCTACA ATTAA
|
Protein sequence | MANKALTMLQ VRRILKLLME ECSQREIHRS TGIHRVTIKS YLHRFTSSGK PFSELYALSD YDLSVLVHPP RSTKTSDERY ADLQPQLQRF SDELNKTNSH VTKQVLWEEY LQDRPTGYQY SQFCYHVDQY IKQHAVTMPQ QHEPGYRLQI DFAGDPLWII DPLTRERIKC PVLVCTLPCS SFFYVEPLSS CRQEHLIPAL NRALAYFGGV PKNILSDNMK QVVTTASRYE PVFNDLMEQW ALHYQTNMQA TRAVRPKDKP SVEGSVHHAY QQIYARLRNE EFTSLSALTY RVRHLLDTAN DRLMTDYGKS RRQRFIELEQ EFLQPLPLTD FVYKRETTAK VKKNYHVILG EDRCQYSVPH EHIGKIVKLI YDESVVEVFL DFQRIALHQR IVGRRGIYRT VEEHMPESHR RYHQQQGWTE EDFTSKAAAV GPCTEEAVLR LLSSKAFAQQ SFDACLGILR LQKKYGTTRL EAACSVALQV PRLNYRLVNN ILENNRDKVS VAAGEQRASL LPLHDNIRGK EVYN
|
| |