Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2069 |
Symbol | |
ID | 4069919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2479770 |
End bp | 2480849 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984083 |
Product | phage integrase |
Protein accession | YP_591144 |
Protein GI | 94969096 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.244007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000737798 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAGAA GAAAAACGCC TGACCGCGAC GGCGTGTACC AGCGCAAGGA CCGTCCGGGC CAGTTTTGGG CAAGCTGGAC TGATCAAAAT GGCAAGCGCC GGCAGCGGCG ACTACGCGGC GTCTTCACTC TGACGCAGGC GAAAGAGTTG CTCGCTGCTG CGCAGCTGCG GGTGGAGCGG ATCAAGGTGC TTGGCTTCGC GGAACCCACA AAGGAACGTT TCGAAAAAAT CGAAGCTCGC TTTCTGCTGC ACCAACGCGC GCGCCTGACG AAAGCAGGCT ACGAACGTGA GCGTGGCGTC GTCGAAACAC ATCTGCACGA TTACTTCGGG CAGATGCAAC TGGCCCTTAT CCGACGTGGC GACATCGCAA ATTTCGTCAC GAAGCGGGCG GCCGAAGTCT CGCCGGCTAC GGTCGCGAAA GAGCTTGTGA CGCTCAAGCA TCTCTTCCGT CTTTGCGTCG AATGGGAGTT GCTGGTGCTA AATCCCGCAA CCGGTGTCAA GGCCCCGAAG GTTGCGCCTG GCCGTGTGCG CTGGCTGCAA CCCGAAGAGC TGCGCGCCCT GTTGCAAGCC TGCCCTGATT GGTTGCGCCC GATCGCAGGT CTCGCCGCGG CTACAGGGAT GCGGCGAAGC GAGGTGCTCG GCGTGCGATG GTTCGACGTG GACCGCAGGA ACGGGTGTCT CACGCTGCGT CAAACCAAGA ACAACGAAAG CCGCATCGTG TTTCTCAATC AATCTGCGCT GCACGTCATC GACTCGCTTC GAAAGGGGAA GTCAAGCGAG AAGCTCTTCC CGAAGGTCAC GCCGGAACAA GTGAGCATGG GATTCCTTCG GACCTGCCGT GAGCTTGGTA TCGAGGATTT TCGATGGCAC GATTTGCGGC ACTGCTTCGC CTCACTACTG CGTCAGTCGG GCGCCGACTT GCAGGATGTA GCCGAGTTGC TTGGTCATAA GGATCTCCGA ATGACGAAGC GATACTCCCA TCTCTCGCCA GCCCACCTAT CAGCGGCGGC CAGGCGATTC GATGCCGTGC TCGGGGAGCC CTTGTCGCTT GCTGACCCGA GCGGAGGGCG GGGAGCATAG
|
Protein sequence | MPRRKTPDRD GVYQRKDRPG QFWASWTDQN GKRRQRRLRG VFTLTQAKEL LAAAQLRVER IKVLGFAEPT KERFEKIEAR FLLHQRARLT KAGYERERGV VETHLHDYFG QMQLALIRRG DIANFVTKRA AEVSPATVAK ELVTLKHLFR LCVEWELLVL NPATGVKAPK VAPGRVRWLQ PEELRALLQA CPDWLRPIAG LAAATGMRRS EVLGVRWFDV DRRNGCLTLR QTKNNESRIV FLNQSALHVI DSLRKGKSSE KLFPKVTPEQ VSMGFLRTCR ELGIEDFRWH DLRHCFASLL RQSGADLQDV AELLGHKDLR MTKRYSHLSP AHLSAAARRF DAVLGEPLSL ADPSGGRGA
|
| |