Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3389 |
Symbol | |
ID | 4072725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4010965 |
End bp | 4013034 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637985411 |
Product | integrase catalytic subunit |
Protein accession | YP_592464 |
Protein GI | 94970416 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC TCTGGCTGAT CGAAGCTGAT GTGCTCGCCG CCATCTCGTT CAGTGAACGC CATCTCCGCC GTCTCGCCCG CGAAGGCAAG GTCATCAGCC GCGCGTCCGC TACTAAAGCC GCCAATGGCC GCTTCGTTCG CGAGTACTGC ATCGAGAGTT TTCCCACCGA TCTTCGCGAG AGGCTGCTCG CCGTGCGAGG TGCGATCACT CTCCTCCCAC AGAACAGCGC AACCGCCGCT GAGCAGCCCC TCCTGAAGAT CGCCTGCGTT GACGAGGAAG AAGAAGCCCA GGCCGCGCGG CGCGAAGCTG CCTGCCTCGC CATCGCCAAC TTCGAACAGC ACAAGGCGCA ATGGGCTACC GTGCGCCTAG CGGATGGCGA GCCTGTCACT TCCATAAGCC GCCTGGTCGA GCATCTCGCG GCGAAGTATT CATCCAGCCC CGCGACAATC TGGCGCTGGC ACAGCCGCTT CAAGAAGGGC GAACGACTCG CCGACCGGAC CCGCGAAGAC AAAGGACGCT CGCGCTGGTT TGCTCGGCAC GGCGACGCCG CAGCAGCCGC TGCCTATATC GGGTTGAAGT GGGGCGCGCG TGAAGCGCAT CGCTCGCTGC TCTCTCACCA TGAGCTCCTC GGCATCGCGA AAGAGGAGAT GCCGTCTTAC GAGACGGTGC GCTCGTTCCT GAATTCCGCG CCGCCGGCGA TGAGCATTCT GGTGCGCGAT GGCGAGCGCC GCTACCGCGA CCTGATCTCG CCCTACGTCC GTCGCGGCTA CAACGAATAC GCAAATCAAA TCTGGGTCAG CGATCACATG ATCCACGATC TCTTCGCGCA GAACGATGTC TTTGACGATA TCCCGCGCGG CCAGCGTATT CGCATGCGCC TTACCGCGCT GCTCGATTTC CGCGCCCGCT ACGTCGTTGG TTACAGCTTC GCCGAAGAAG GCAGCTCAAT CTCCATCACA ACCTGCCTAC GCCAGGCGAT CGCCAGCTAT GGCGCATGCG AAGAGTTCTA TTGCGACAAC GGCAAGGACT ACAAATCCGT TGCCAAGGCC GCGCTGCCCG CGTATCTGCG CGATTCCGGT AAGGCCCCAC AGGACTGGTG GCAGCAGGAG CTCGACACGC AGGCCGGCGT GCTTGCGCGC TGCGGGATCT CGATCCGCCA TTGCATCGTG CGGCATCCGC AGTCGAAACA CGTTGAACGC TTCTTCCGCA CCGTGCACAA ACAGTTCGAC GCGCTCTTCC CAACTTATAG CGGCTCGAAT CCAGATCGCC GTCCTGAGTT CACATCGAAG GCCATCGCCG AGCACAGCCG TCTGGAGCGT GAGAGCGCGC GACTGATCCA GATGGGCAAA GGCATCAACG GTCTGCACCA ATCGCTCTTG CCGCCAGCAA CGCTGGTGAT GAAGCTCTTC CGCGCCTGGC TTGACGAGTA CCACAACACG CCGCATGGCG GTCAGGGTAT GGACGGTCGC ACACCGGCGC AGGTCTTCGA GCAGGAGCGC AACCCGTTGC AACGCCCCGC GCCGGCCGAC AACGTTCTGG CCCTCATGTT GTGTTCGCGG GAGCAGCGCA TGGTGCGCGA GTGCTCGGTC ACCGTGGGCA AACGGCGCTT CATCGGCGCG GATTTCACCG CGGTGAAGCG ACTGCACGAC GTGAGCAACT GCGAAGTGAT GGTGGCCTAC GACCCGCTCG ATCTCGATCG CGTGGCGGTC CTCGATCTCG ACGGCAACCT CATTTGCTGG GCCAAGCCCG AAGAGTTCCT CCCCCAGAAC ACCACCCAGG CGGCGAACGC GATTGCGGAA AGCATGCAGC AGCGCCGCCG CCTGGAGCGC AACACACGCG ATGCCTACGT GGCCATGCGC GATGCCGCCC GCGCCTCCGG TGTCGTCACC GGCGTGGAGC GGTTGGTGAA CAAGGTGCTC GCGCTGCCAC CCGCCAGCGA GGTGCCGGCA GTGCAACGCA GTTCAATGGC GCGCGCGGCC AAGGCAGCAG CAGCGGCCGC ACCCAAGGTG AGCCAGAAGT ATGTAGGCGA CGTAGCCGAA GAGATCGCCG GATTGATGGA GGGGGACTGA
|
Protein sequence | MSDLWLIEAD VLAAISFSER HLRRLAREGK VISRASATKA ANGRFVREYC IESFPTDLRE RLLAVRGAIT LLPQNSATAA EQPLLKIACV DEEEEAQAAR REAACLAIAN FEQHKAQWAT VRLADGEPVT SISRLVEHLA AKYSSSPATI WRWHSRFKKG ERLADRTRED KGRSRWFARH GDAAAAAAYI GLKWGAREAH RSLLSHHELL GIAKEEMPSY ETVRSFLNSA PPAMSILVRD GERRYRDLIS PYVRRGYNEY ANQIWVSDHM IHDLFAQNDV FDDIPRGQRI RMRLTALLDF RARYVVGYSF AEEGSSISIT TCLRQAIASY GACEEFYCDN GKDYKSVAKA ALPAYLRDSG KAPQDWWQQE LDTQAGVLAR CGISIRHCIV RHPQSKHVER FFRTVHKQFD ALFPTYSGSN PDRRPEFTSK AIAEHSRLER ESARLIQMGK GINGLHQSLL PPATLVMKLF RAWLDEYHNT PHGGQGMDGR TPAQVFEQER NPLQRPAPAD NVLALMLCSR EQRMVRECSV TVGKRRFIGA DFTAVKRLHD VSNCEVMVAY DPLDLDRVAV LDLDGNLICW AKPEEFLPQN TTQAANAIAE SMQQRRRLER NTRDAYVAMR DAARASGVVT GVERLVNKVL ALPPASEVPA VQRSSMARAA KAAAAAAPKV SQKYVGDVAE EIAGLMEGD
|
| |