Gene Caul_1505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1505 
Symbol 
ID5898960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1600098 
End bp1602788 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content68% 
IMG OID641561992 
ProductDNA topoisomerase I 
Protein accessionYP_001683133 
Protein GI167645470 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.143439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCG TCGTCGTCGA GAGCCCGGCC AAGGCCAAGA CCATCAACAA GTACCTCGGG 
TCCGGCTACA CGGTTCTCGC CTCGTACGGC CACATCCGCG ACCTGCCGTC CAAGGACGGC
TCGGTCGAGC CCGATAACGA CTTCGCCATG CACTGGGAGG CCGACGCCAA GGGCGCCAAG
CGGATCGGCG ACATCGTCGA CGCCATGAAG GGCGCCGACG GCGTCATCCT GGCCACCGAC
CCCGACCGCG AAGGCGAAGC GATCAGCTGG CACGTGCTGG AGGTGCTGCA GAAGAAGAAG
GCGATCAAGG ACAAGTCGGT CCAGCGCGTC ACCTTCAACG CCATCACCAA GACCTCGGTG
CTCGAGGCCA TGGCCCATCC GCGCGACATC GACATGGAGC TGGTCGAGGC CTATCTGGCC
CGCCGCGCCC TGGACTATCT GGTCGGCTTC ACCCTGTCGC CGGTGCTGTG GCGCAAGCTG
CCGGGCAGCC GCTCGGCCGG CCGCGTGCAA TCGGTCTGCC TGCGGCTGAT CGTCGACCGC
GAGCTGGAGA TCGAGCGCTT CAAGACCCAG GAATACTGGA GCGTCGAGGC CGACGTCACG
GCCGGCGCCG AGCCGTTCGT CGCCCGCCTG GTCAAGCACG AGAACAAGAA GCTCACCAAG
TTCGACCTGA ATAACGAGAG CTCGGCCCTG GCCGCCAAGG CCGCTGTCGA GAAGGCCGTG
TTCAAGGTCG CCGCCGTCGA GAAGAAGCCC GGCAAGCGCT CGCCCGCCCC GCCCTTCACC
ACCTCCACCC TGCAGCAGGA AGCCTCGCGC AAGCTGGGCT TTTCCGCCCA ACGCACCATG
CAGGCCGCGC AGAAGCTGTA TGAAGGCATC GACATCGGCG GCGAGACCGT CGGTCTGATC
ACCTACATGC GGACCGACGG CGTGTCGGTC GAACCCGAAG GCATCGCCGA GGCGCGCAGC
GTGATCGGCA GCGTCTATGG CGAGACCTAC GTCCCCGAGA CCCCGCGCTA CTACAAGGCC
AAGGCCAAGA ACGCCCAGGA GGCCCACGAA GCCATCCGGC CGACCAGCCT GAAGCGCAAC
CCGGGCTCGC TGCGCCTGGA GTCGGACCTG GGCCGCCTGT ACGAGCTGAT CTGGAAGCGG
ATGATCGCCT CGCAGATGGA GAGCGCCCGC ATCGAGCGCA CCACCGTCGA CCTGGAAAGC
GCCGATGGCC AAACCGGGAT GCGCGCCACG GGGCAGGTCG TGCTGTTCCC CGGCTATCTG
GCCGTCTACG AGGAAGGCCG CGACGACGAG GGCGACGAGG ACAGCGCCCG CCTGCCGATG
ATCGAGGAAG GCGCCGCCGC CAAGGTGCTC GACGCCCGCG CCGACCAGCA CTTCACCGAG
CCGCCCCCGC GCTATTCGGA AGCCAGCCTC GTCAAGAAGA TGGAAGAGCT GGGCATCGGC
CGCCCGTCGA CCTACGCCTC GGTGCTGACC GTGCTGCGCG ACCGCGAATA TGTCCGCATG
GACAAGCAGC GGTTCATCCC CGAGGACAAG GGCCGTCTGG TCACCGCCTT CCTGGAGCAG
TTCTTCCGCC GCTACGTGGA GTACGACTTC ACCGCCGCCC TGGAAGAACA GCTGGACCTG
GTGTCGGACG GCAAGCTGGA CTGGAAGCAG TTCCTCCGCG ACTTCTGGAA GGACTTCCAC
GCCGCCGTCG GCGAGATCGC CGAGCTGCGC ACCACCAACG TGCTGGACGC TCTGAACGAG
TCGCTGGGCC CGCACATCTT CCCCGACAAG GGCGACGGGT CCGACCCGCG CCTGTGCCCG
ACCTGCGGCA CGGGACAGCT GTCGCTGAAA GTCGGCAAGT TCGGGGCCTT CATCGGCTGC
TCGAACTATC CCGAATGCCG CTTCACCCGC CAGTTGGCCA CCGCCGAGGG CGAAGGCGAG
GCGGAGGCCG CCGACAAGGA GTTGGGGATC AACCCGGCGA CCGGTCGCGC GGTGTGGCTG
AAGAACGGCC GCTTCGGTCC CTACGTCGAG GAGCCGGCGG CGGAGGGCAG CGGCGACAAG
CCCAAGCGCT CCAGCCTGCC CAAGGGCTGG ACGCCGGCCG GGCTGGACCT GGAAAAGGCC
CTGCGCCTGC TCGCCCTGCC CCGCGAGGTC GGGATGCACC CCGACGACGG CAAGAAGATC
ACCGCCGGCC TCGGCCGCTT CGGACCGTTC GTGCTGCACG AGGGCACCTA CGCCAATCTC
GAGAACCCGG AAGAGGTGTT CGACATCGGC CTGAACCGCG CGGTCGCTTT GCTGGCCGAC
AAGCGGGCCG GCGGCGGACG CCCGCAGCGG GGACAAGCGG CGGCGCTGGC CGACCTGGGC
GTCCACCCCG AGGACGGCAA GCCGGTGAAG GTCCTGTCGG GGCGCTTCGG GCCCTACATT
AAGCACGGCG ACACCAACGC CAATGTCCCC AAGGGCGCCG ATCCCGCCGC CCTGACCCTG
GCCGAGGCCG TGGTCCTGCT GGCCGACCGC GTCGCCAAGG GCGGCGGCAA GAAGCCGGCG
AAGAAGGCCG CCGCCAAGAA GGCTCCGGCG AAAGCCAAGG CCGCCGCCGC GACCGACGGC
GGAGCTCCAG CGAAGAAGGC TCCCGCCAAG AAGGCTCCGG CCAAGGCGAC TGGCGCCAAG
AAAACGGCGG CGAAGAAGCC CGCCGCGAAG AAGGCCAAAG CCGAGGCGTG A
 
Protein sequence
MIVVVVESPA KAKTINKYLG SGYTVLASYG HIRDLPSKDG SVEPDNDFAM HWEADAKGAK 
RIGDIVDAMK GADGVILATD PDREGEAISW HVLEVLQKKK AIKDKSVQRV TFNAITKTSV
LEAMAHPRDI DMELVEAYLA RRALDYLVGF TLSPVLWRKL PGSRSAGRVQ SVCLRLIVDR
ELEIERFKTQ EYWSVEADVT AGAEPFVARL VKHENKKLTK FDLNNESSAL AAKAAVEKAV
FKVAAVEKKP GKRSPAPPFT TSTLQQEASR KLGFSAQRTM QAAQKLYEGI DIGGETVGLI
TYMRTDGVSV EPEGIAEARS VIGSVYGETY VPETPRYYKA KAKNAQEAHE AIRPTSLKRN
PGSLRLESDL GRLYELIWKR MIASQMESAR IERTTVDLES ADGQTGMRAT GQVVLFPGYL
AVYEEGRDDE GDEDSARLPM IEEGAAAKVL DARADQHFTE PPPRYSEASL VKKMEELGIG
RPSTYASVLT VLRDREYVRM DKQRFIPEDK GRLVTAFLEQ FFRRYVEYDF TAALEEQLDL
VSDGKLDWKQ FLRDFWKDFH AAVGEIAELR TTNVLDALNE SLGPHIFPDK GDGSDPRLCP
TCGTGQLSLK VGKFGAFIGC SNYPECRFTR QLATAEGEGE AEAADKELGI NPATGRAVWL
KNGRFGPYVE EPAAEGSGDK PKRSSLPKGW TPAGLDLEKA LRLLALPREV GMHPDDGKKI
TAGLGRFGPF VLHEGTYANL ENPEEVFDIG LNRAVALLAD KRAGGGRPQR GQAAALADLG
VHPEDGKPVK VLSGRFGPYI KHGDTNANVP KGADPAALTL AEAVVLLADR VAKGGGKKPA
KKAAAKKAPA KAKAAAATDG GAPAKKAPAK KAPAKATGAK KTAAKKPAAK KAKAEA