Gene Caul_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1744 
Symbol 
ID5899199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1831687 
End bp1833966 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content68% 
IMG OID641562234 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001683371 
Protein GI167645708 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.629764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC CAGTCCTTCC CCCTGGCGGC CCCGGCGACG GCGATCGCAT TCTCGACGAA 
CCGCTCAGCG AAGCGCTTTC GAAGCGCTAC CTGGCCTATG CGCTGTCGAC GATCGGCTCG
CGGGCCTTGC CCGACGTGCG CGATGGCCTG AAGCCCGTGC ACCGCCGCGT GCTGCACGCG
ATGAACAACA TGCGCCTCAA CCCCGAGGGC GGCGCGCGCA AGTGCGCCAA GGTGGTCGGC
GAGGTGATGG GTAACTTCCA CCCGCACGGC GACCAGTCGA TCTATGACGC CCTGGTCCGC
CTGGCCCAGG AGTTCTCGCA GCGCGTGCCA TTGGTCGATG GCCAGGGCAA CTTCGGCAAT
ATCGACGGCG ATAGCGCCGC GGCCATGCGC TACACCGAGT GCAAGATGAC CGCCGCGGCC
GTGCTGCTGC TGGACGGCAT CGACGAGGAC GCGGTCGACT TCAAGCCGTC CTATGACGGC
CAGGACGAGG AGCCGGTGGT CCTGCCGTCG GGCTTCCCCA ACCTGCTGGC CAACGGCTCG
TCGGGCATCG CGGTGGGCAT GGCCACCTCG ATCCCGCCAC ACAATCCCGC CGAGCTGATC
GACGCCTGCC TGCTGCTGCT CAGCAAGCCC GAGGCGACGA CGGCCGAGAT CCTGGAACGC
GTGCCGGGTC CGGACTTCCC GACCGGCGGC GTGATCGTCG AGTCCCGCGA GAGCCTGCTG
GAGACCTACG AGACCGGCCG CGGCGGCGTG CGCACCCGCG CCAAGTGGGA GAAGGAAGAC
ACCGGCCGCG GGACCTACCA GATCGTCGTC ACCGAGATCC CGTACCAGGT GAAGAAGTCC
GACCTGGTCG AGCAACTGGC CGACCTGATC GACAGCAAGA AGGCCGCCCT GCTGGGCGAC
GTCCGCGACG AGAGCGCCGA GGACATCCGC CTGGTGCTGG AGCCCAAGTC CAAGAACGTC
GAGCCCGAAG TGCTGATGGA GAGCCTGTTC AAGCTCTCGG CGCTGGAAAG CCGCTTCCCG
GTCAATATCA ACGTGCTGGA CGCGCGCGGC ACCCCCGGGG TGATGGGCAT CAAGCAGGCG
CTGATCGCCT TCCTGGCCCA CCGCCGCGAC GTGCTGACCC GCCGGGCCCG CAACCGGCTG
AGCAAGATCG AAGCCCGTCT GCACATCCTG GACGGCCTGC TGATCGCCTA CCTCAATCTC
GACGAGGTGA TCCGCATCGT CCGCTACGAG GACGAGCCCA AGCAGAAGCT GATGGCCGCC
TTCGCCCTCA GCGACATCCA GGCCGACGCC ATCCTCAACA CCCGCCTGCG CCAATTGGCC
AAGCTGGAGG AGATGGAGAT CCGCCGCGAG CACGCGCAGC TGGTGGAAGA GCGCGACGGC
ATCCTGGCGA TGCTGGCCAG CGACAAGAAG CAGTGGGACC TGGTCGGCAC GGGCCTGCGC
CAGGTGCGCG CCGTGCTGCT GAAGATCAAG CACCCGCTGG ACAAGACGGG CCGCGCGACA
GGGGTCATCG GCCGCTCGGT GTTCGAGGAC GCCCCGGTGG TCGACGCCGA GGCCGCGCTC
GAGGCGCTGA TCGTGCGCGA GCCGATCACC GTCATCCTGT CGGACCGCGG CTGGATCCGC
GCCGCCAAGG GCAAGGTCGA GGACCCCTCG GAGCTGAAGT TCAAGGAAGG CGACAAGCTG
GGCTTCCTGG TCCCGGCCGA GACCACCGAC AAGCTCTTGC TGTTCACCAG CGACGGCCGG
TTCTTCACGA TCGGCTGCGC CAACCTGCCG TCGGCCCGCG GTCACGGCGA GCCGGTGCGA
ATGATGATCG AGCTGGACGA CAAGGTGAAG ATCATCGACG TCTTCCCGTT CAAGGCCGGG
CGCAAGCGCT TCCTGGCCTC CAAACAGGGC TACGGCTTCC TGATGCCGGA AGAAGAGGCC
CTGGCCAACC GCAAGGCCGG CAAGCAGGTC CTGACGGTCG ACGCGGCCGG CGCGGCCTTC
TGCCTGGAGG CCGTCGGCGA CCAGCTCGCG GTGGTCGGCG ATAACGGCAA GATCCTGATC
TTCCCGCTGG AGGAATTGCC GGAAATGCCG CGCGGCAAGG GCGTCAAGCT GCAGTCGTAT
CGTGAAGGGG GATTGCGCGA CGCCATCTCC TTCAACGCCG ACGTCGGGGC GTTCTGGATC
GACACCGCCG GCCGCCGGCG CGACTGGGTC GAGTGGCGCG ACTGGATCGG CAAGCGCGCA
GGGGCCGGCA AGCTGTCGCC CAAGGGCTTC CCGACCTCAA AGCGGTTCCG GCCCAAGTGA
 
Protein sequence
MNKPVLPPGG PGDGDRILDE PLSEALSKRY LAYALSTIGS RALPDVRDGL KPVHRRVLHA 
MNNMRLNPEG GARKCAKVVG EVMGNFHPHG DQSIYDALVR LAQEFSQRVP LVDGQGNFGN
IDGDSAAAMR YTECKMTAAA VLLLDGIDED AVDFKPSYDG QDEEPVVLPS GFPNLLANGS
SGIAVGMATS IPPHNPAELI DACLLLLSKP EATTAEILER VPGPDFPTGG VIVESRESLL
ETYETGRGGV RTRAKWEKED TGRGTYQIVV TEIPYQVKKS DLVEQLADLI DSKKAALLGD
VRDESAEDIR LVLEPKSKNV EPEVLMESLF KLSALESRFP VNINVLDARG TPGVMGIKQA
LIAFLAHRRD VLTRRARNRL SKIEARLHIL DGLLIAYLNL DEVIRIVRYE DEPKQKLMAA
FALSDIQADA ILNTRLRQLA KLEEMEIRRE HAQLVEERDG ILAMLASDKK QWDLVGTGLR
QVRAVLLKIK HPLDKTGRAT GVIGRSVFED APVVDAEAAL EALIVREPIT VILSDRGWIR
AAKGKVEDPS ELKFKEGDKL GFLVPAETTD KLLLFTSDGR FFTIGCANLP SARGHGEPVR
MMIELDDKVK IIDVFPFKAG RKRFLASKQG YGFLMPEEEA LANRKAGKQV LTVDAAGAAF
CLEAVGDQLA VVGDNGKILI FPLEELPEMP RGKGVKLQSY REGGLRDAIS FNADVGAFWI
DTAGRRRDWV EWRDWIGKRA GAGKLSPKGF PTSKRFRPK