Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1505 |
Symbol | |
ID | 5898960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1600098 |
End bp | 1602788 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561992 |
Product | DNA topoisomerase I |
Protein accession | YP_001683133 |
Protein GI | 167645470 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.143439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTCG TCGTCGTCGA GAGCCCGGCC AAGGCCAAGA CCATCAACAA GTACCTCGGG TCCGGCTACA CGGTTCTCGC CTCGTACGGC CACATCCGCG ACCTGCCGTC CAAGGACGGC TCGGTCGAGC CCGATAACGA CTTCGCCATG CACTGGGAGG CCGACGCCAA GGGCGCCAAG CGGATCGGCG ACATCGTCGA CGCCATGAAG GGCGCCGACG GCGTCATCCT GGCCACCGAC CCCGACCGCG AAGGCGAAGC GATCAGCTGG CACGTGCTGG AGGTGCTGCA GAAGAAGAAG GCGATCAAGG ACAAGTCGGT CCAGCGCGTC ACCTTCAACG CCATCACCAA GACCTCGGTG CTCGAGGCCA TGGCCCATCC GCGCGACATC GACATGGAGC TGGTCGAGGC CTATCTGGCC CGCCGCGCCC TGGACTATCT GGTCGGCTTC ACCCTGTCGC CGGTGCTGTG GCGCAAGCTG CCGGGCAGCC GCTCGGCCGG CCGCGTGCAA TCGGTCTGCC TGCGGCTGAT CGTCGACCGC GAGCTGGAGA TCGAGCGCTT CAAGACCCAG GAATACTGGA GCGTCGAGGC CGACGTCACG GCCGGCGCCG AGCCGTTCGT CGCCCGCCTG GTCAAGCACG AGAACAAGAA GCTCACCAAG TTCGACCTGA ATAACGAGAG CTCGGCCCTG GCCGCCAAGG CCGCTGTCGA GAAGGCCGTG TTCAAGGTCG CCGCCGTCGA GAAGAAGCCC GGCAAGCGCT CGCCCGCCCC GCCCTTCACC ACCTCCACCC TGCAGCAGGA AGCCTCGCGC AAGCTGGGCT TTTCCGCCCA ACGCACCATG CAGGCCGCGC AGAAGCTGTA TGAAGGCATC GACATCGGCG GCGAGACCGT CGGTCTGATC ACCTACATGC GGACCGACGG CGTGTCGGTC GAACCCGAAG GCATCGCCGA GGCGCGCAGC GTGATCGGCA GCGTCTATGG CGAGACCTAC GTCCCCGAGA CCCCGCGCTA CTACAAGGCC AAGGCCAAGA ACGCCCAGGA GGCCCACGAA GCCATCCGGC CGACCAGCCT GAAGCGCAAC CCGGGCTCGC TGCGCCTGGA GTCGGACCTG GGCCGCCTGT ACGAGCTGAT CTGGAAGCGG ATGATCGCCT CGCAGATGGA GAGCGCCCGC ATCGAGCGCA CCACCGTCGA CCTGGAAAGC GCCGATGGCC AAACCGGGAT GCGCGCCACG GGGCAGGTCG TGCTGTTCCC CGGCTATCTG GCCGTCTACG AGGAAGGCCG CGACGACGAG GGCGACGAGG ACAGCGCCCG CCTGCCGATG ATCGAGGAAG GCGCCGCCGC CAAGGTGCTC GACGCCCGCG CCGACCAGCA CTTCACCGAG CCGCCCCCGC GCTATTCGGA AGCCAGCCTC GTCAAGAAGA TGGAAGAGCT GGGCATCGGC CGCCCGTCGA CCTACGCCTC GGTGCTGACC GTGCTGCGCG ACCGCGAATA TGTCCGCATG GACAAGCAGC GGTTCATCCC CGAGGACAAG GGCCGTCTGG TCACCGCCTT CCTGGAGCAG TTCTTCCGCC GCTACGTGGA GTACGACTTC ACCGCCGCCC TGGAAGAACA GCTGGACCTG GTGTCGGACG GCAAGCTGGA CTGGAAGCAG TTCCTCCGCG ACTTCTGGAA GGACTTCCAC GCCGCCGTCG GCGAGATCGC CGAGCTGCGC ACCACCAACG TGCTGGACGC TCTGAACGAG TCGCTGGGCC CGCACATCTT CCCCGACAAG GGCGACGGGT CCGACCCGCG CCTGTGCCCG ACCTGCGGCA CGGGACAGCT GTCGCTGAAA GTCGGCAAGT TCGGGGCCTT CATCGGCTGC TCGAACTATC CCGAATGCCG CTTCACCCGC CAGTTGGCCA CCGCCGAGGG CGAAGGCGAG GCGGAGGCCG CCGACAAGGA GTTGGGGATC AACCCGGCGA CCGGTCGCGC GGTGTGGCTG AAGAACGGCC GCTTCGGTCC CTACGTCGAG GAGCCGGCGG CGGAGGGCAG CGGCGACAAG CCCAAGCGCT CCAGCCTGCC CAAGGGCTGG ACGCCGGCCG GGCTGGACCT GGAAAAGGCC CTGCGCCTGC TCGCCCTGCC CCGCGAGGTC GGGATGCACC CCGACGACGG CAAGAAGATC ACCGCCGGCC TCGGCCGCTT CGGACCGTTC GTGCTGCACG AGGGCACCTA CGCCAATCTC GAGAACCCGG AAGAGGTGTT CGACATCGGC CTGAACCGCG CGGTCGCTTT GCTGGCCGAC AAGCGGGCCG GCGGCGGACG CCCGCAGCGG GGACAAGCGG CGGCGCTGGC CGACCTGGGC GTCCACCCCG AGGACGGCAA GCCGGTGAAG GTCCTGTCGG GGCGCTTCGG GCCCTACATT AAGCACGGCG ACACCAACGC CAATGTCCCC AAGGGCGCCG ATCCCGCCGC CCTGACCCTG GCCGAGGCCG TGGTCCTGCT GGCCGACCGC GTCGCCAAGG GCGGCGGCAA GAAGCCGGCG AAGAAGGCCG CCGCCAAGAA GGCTCCGGCG AAAGCCAAGG CCGCCGCCGC GACCGACGGC GGAGCTCCAG CGAAGAAGGC TCCCGCCAAG AAGGCTCCGG CCAAGGCGAC TGGCGCCAAG AAAACGGCGG CGAAGAAGCC CGCCGCGAAG AAGGCCAAAG CCGAGGCGTG A
|
Protein sequence | MIVVVVESPA KAKTINKYLG SGYTVLASYG HIRDLPSKDG SVEPDNDFAM HWEADAKGAK RIGDIVDAMK GADGVILATD PDREGEAISW HVLEVLQKKK AIKDKSVQRV TFNAITKTSV LEAMAHPRDI DMELVEAYLA RRALDYLVGF TLSPVLWRKL PGSRSAGRVQ SVCLRLIVDR ELEIERFKTQ EYWSVEADVT AGAEPFVARL VKHENKKLTK FDLNNESSAL AAKAAVEKAV FKVAAVEKKP GKRSPAPPFT TSTLQQEASR KLGFSAQRTM QAAQKLYEGI DIGGETVGLI TYMRTDGVSV EPEGIAEARS VIGSVYGETY VPETPRYYKA KAKNAQEAHE AIRPTSLKRN PGSLRLESDL GRLYELIWKR MIASQMESAR IERTTVDLES ADGQTGMRAT GQVVLFPGYL AVYEEGRDDE GDEDSARLPM IEEGAAAKVL DARADQHFTE PPPRYSEASL VKKMEELGIG RPSTYASVLT VLRDREYVRM DKQRFIPEDK GRLVTAFLEQ FFRRYVEYDF TAALEEQLDL VSDGKLDWKQ FLRDFWKDFH AAVGEIAELR TTNVLDALNE SLGPHIFPDK GDGSDPRLCP TCGTGQLSLK VGKFGAFIGC SNYPECRFTR QLATAEGEGE AEAADKELGI NPATGRAVWL KNGRFGPYVE EPAAEGSGDK PKRSSLPKGW TPAGLDLEKA LRLLALPREV GMHPDDGKKI TAGLGRFGPF VLHEGTYANL ENPEEVFDIG LNRAVALLAD KRAGGGRPQR GQAAALADLG VHPEDGKPVK VLSGRFGPYI KHGDTNANVP KGADPAALTL AEAVVLLADR VAKGGGKKPA KKAAAKKAPA KAKAAAATDG GAPAKKAPAK KAPAKATGAK KTAAKKPAAK KAKAEA
|
| |