Gene Caul_2808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2808 
SymboldnaE 
ID5900263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3047358 
End bp3050789 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content67% 
IMG OID641563300 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001684433 
Protein GI167646770 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.30719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.975511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGCA GCGAAGGTCA GGGTTTCGTC CACCTTCGTG TCCGTTCGGC CTACTCGCTG 
CTGGAAGGCG CGATCAAGGC CGACAAGATC CCCGGCCTGG CCGCCGCGGC CGGCATGCCG
GCGGCAGGAC TGGTCGATCG CAACAACCTG TTCGGCGCGC TCGAATATTC CGTCTATTCC
AAGGACTATG GGGTCCAGCC GATCATCGGT TGCGCCCTGG CGGTTTCAGG CGTCGGCGCC
GGACCGACCG AGCGCTGGGC GCGGACGCCG ACCATCACCC TGCTGGTTCA GAACGAGCGG
GGCTATCTCA ACCTCTCCGA ACTGTCGTCC ATGGCCTATC TGGAAAGCGG CGAGATGGCC
GAGCCGGTCG TGCCTTGGGC CAAGGTGGTC GAGCACGCCG AGGGGTTGAT CCTGCTGTCG
GGCGGAACGG ACGGACCGGT CGACGCCCTG CTGGCCGCAG GCAAGACCGC CGAGGGCGAG
GCGGCGCTGG TCGAGATGCA GCGGGCGTTC GGCGACCGCT TCTATGTCGA GCTGCAGCGC
CATGGCCTGC CGCGCCAGGC CGCCGCCGAG CCGGGTCTGG TCCACTGGGC CTATGAGCAC
GACGCGCCGC TGGTGGCCAC CAACGACGTC TACTACGCCA AGCCCGAGCT CTATGACGCC
CACGACGCCC TGCTGTGCAT TTCCGACGGC GCTTTCGTCG GCCAGGACGA ACGACGGCGG
GTGACGCCCG AGCACTGGTT CAAGTCGTCG GCCGACATGC GAAAGCTGTT CGCCGACCTG
CCGGAAGCCT GCGACAACAC CCTGGATATC GCCCGCCGCT GCGCCTTCAT GGTGCAGAAG
CGCGACCCGA TCCTGCCCAG TTTCCCGACC GGCGATGGCC GTAGCGAGCC CGAGGAACTG
ACCCACCAGG CCAAGGAGGG CCTGCGAAAG CGCCTTGATG GCCTGGAACT GGCGGTCGAC
GAGAAGGTCT ATTGGGACCG CCTCGACTTC GAGCTGTCGA TCATCATCAA GATGGGCTTT
CCCGGCTATT TCCTGATCGT GTCGGACTTC ATCAAGTGGG CCAAGGAGCA CGGCATTCCC
GTGGGGCCCG GGCGGGGTTC GGGCGCCGGG TCGCTGGTCG CCTGGGTGCT GACCATCACC
GATCTGGACC CGCTGCGCTT TGGCCTGCTG TTCGAACGGT TCCTGAACCC CGAGCGGGTC
TCCATGCCCG ACTTCGACAT CGACTTCTGC CAGGAGCGGC GGGAAGAGGT GATCGTCTAC
GTGCAGGAGA AGTACGGCCG CGATCGCGTG GCCCAGATCA TCACCTTCGG CTCCCTGCAG
GCGCGGGCCG TGCTGCGCGA CGTCGGCCGG GTGATGCAGC TGCCGCTGGG CCTGGTCGAC
CGGCTCTGCA AGATGGTGCC CAACAATCCG GCCGCGCCGG TGACCCTGGC CCAGGCCATC
GAGATCGAGC CGCGCCTCAA GCAAGCCCGT GATGAGGACG GTAACGTCAA GGCCTGCCTG
GACGTCGCCT TGCAGTTGGA GGGCCTGTTC CGCAACGCCT CCACCCACGC CGCCGGCGTG
GTCATCGGCG ACAGGCCCCT GACCCAGCTG ACGCCGCTCT ACAAGGATCC GCGCTCGGAT
CTGCCGGCCA CCCAGTTCAA CATGAAGTGG GTCGAAAGCG CCGGCCTGGT GAAGTTCGAC
TTCCTGGGCC TGAAGACCCT GACGGTGCTG GACCGGGCGG TGAAGCACCT GAAGAAGCGC
GGCGAGATCA TCGACCTCAG CCGCCTGCCG TTCGACGACA CCAAGACCTA CGAGCTTCTA
GCCTCGGGCC AAACGGTCGG CGTGTTCCAG CTGGAAAGCC AGGGCATGCG CGACACCCTG
CGCAAGATGC GCTGCGGCTC GATCGAGGAG ATCACCGCGC TGATCTCGCT GTACCGCCCG
GGGCCGATGG ACAACATCGA CACCTTCGTC GACTGCAAGT TCGATCGAAA GCCTGTCGAC
TACCTGCACC CCTCGCTGGA GGTGGTGCTG AAGGAGACCT ACGGCGTCAT CGTCTACCAG
GAACAGGTGA TGCAGATCGC CCAGATCCTG GCCGGCTACA GCCTGGGCGA AGCCGACCTG
CTGCGCCGGG CCATGGGCAA GAAGAAGAAG GAGGAAATGG ATCTCCAGAA GATCCGTTTC
GTCGCCGGCG CCAAGGAGAA GGACGTTCCC GAGGCGCAGT CGGGCTCGAT CTTCGAACTG
GTGGCCAAGT TCGCCGGCTA CGGTTTCAAC AAGTCGCACG CCGCCGCCTA CGCCCTGATC
GCCTACCAGA CGGCCTGGCT GAAGGCCAAT ACGCCGGTCG AGTTCCTGGC CGCCTCGATG
AGCCTGGACC TGTCGAACAC CGACAAGCTG GCGGTCTTCC ACCAGGACGC CCGGCGGTTC
GACATCGTCG TGCGTCCGCC GGACGTCAAT CGCTCGGGCG CCGATTTCGA GGTCGAGAAC
GGCGAGGTGC TGTACGCGCT CGGCGCCGTG CGCAACGTCG GCCTCGAAGC CATGAAGCAC
CTGGTGGCCA TCCGGGAGGA GGGCGGGCCT TTCCGCGATA TCTTCGATTT CGTCGAACGG
GTGGATCCCA AGCTGGTCAA CAAGCGAGCG ATCGAGAACC TGGCCCGGGC CGGGGCCTTC
GACTCCCTGT CCAAGAACCG CGCTCAGATC TTCGCCTCGG CCGACGTGTT GATCGCCCAT
GGCCAGAGCA TCGCCGCCGA CCGCCAGGGC GGCCAGCACG CCCTGTTCGG CGGCGACCCG
GCCGCCGGCC GTCCGCGCCT CAAGAAGACC GAGCCCTGGA GCCAGGTCGA CCTGCTCGAC
GAGGAGCTGG CCGCGGTCGG CTTCTACCTG ACTGGCCACC CGCTGGACGA CATGGTCGGG
GTGCTGCGCC GCCGGCGCAC CCACATGCTG ACCGAGGTCA TCCCGCGGGC CGAGGCTGGC
ATGGAGGCTT TCCGGATGTG CGGCGTGGTC CGCCGCCGCC AAGAACGCGC CTCGCAGAGC
GGCGAACGCT TCGCCTTCGT CTCGCTGTCG GATCCCAGCG GCGAATATGA GGTGCTGTTC
CCGCCCGAGG CCCTGCGCAA GTGCCGCGAG GTGCTGGAGC CCGGCAAGGC GGTGGCGATC
AAGGTCCGCG CCAAGGCCCG CGACGGCGAG GTGCGGTTTT TCGGCGACGA CGCCGAGCCG
ATCGAGAAGG CCATCGAGAA CATGGTGGCC GGGTTGCGCA TGCACCTGTC GCCGTCGGCC
ACCGAGATCG ACGCCTTGCG GCGCCGGCTG GAGGCGGCCG CTTCGCCGCG CGGCGGCGAG
GTCAGTCTGA TCGCCGCCCT GGGCGGCGGT CGCGAGATCG AGATGAAGCT GCCCGGCCGC
TACACCCTCG ACGCCGCCTT GCGCGGCGCC CTGAAGACCG CGCCCGGCGT GGCGCTGCTG
GAAGACGTCT AG
 
Protein sequence
MSGSEGQGFV HLRVRSAYSL LEGAIKADKI PGLAAAAGMP AAGLVDRNNL FGALEYSVYS 
KDYGVQPIIG CALAVSGVGA GPTERWARTP TITLLVQNER GYLNLSELSS MAYLESGEMA
EPVVPWAKVV EHAEGLILLS GGTDGPVDAL LAAGKTAEGE AALVEMQRAF GDRFYVELQR
HGLPRQAAAE PGLVHWAYEH DAPLVATNDV YYAKPELYDA HDALLCISDG AFVGQDERRR
VTPEHWFKSS ADMRKLFADL PEACDNTLDI ARRCAFMVQK RDPILPSFPT GDGRSEPEEL
THQAKEGLRK RLDGLELAVD EKVYWDRLDF ELSIIIKMGF PGYFLIVSDF IKWAKEHGIP
VGPGRGSGAG SLVAWVLTIT DLDPLRFGLL FERFLNPERV SMPDFDIDFC QERREEVIVY
VQEKYGRDRV AQIITFGSLQ ARAVLRDVGR VMQLPLGLVD RLCKMVPNNP AAPVTLAQAI
EIEPRLKQAR DEDGNVKACL DVALQLEGLF RNASTHAAGV VIGDRPLTQL TPLYKDPRSD
LPATQFNMKW VESAGLVKFD FLGLKTLTVL DRAVKHLKKR GEIIDLSRLP FDDTKTYELL
ASGQTVGVFQ LESQGMRDTL RKMRCGSIEE ITALISLYRP GPMDNIDTFV DCKFDRKPVD
YLHPSLEVVL KETYGVIVYQ EQVMQIAQIL AGYSLGEADL LRRAMGKKKK EEMDLQKIRF
VAGAKEKDVP EAQSGSIFEL VAKFAGYGFN KSHAAAYALI AYQTAWLKAN TPVEFLAASM
SLDLSNTDKL AVFHQDARRF DIVVRPPDVN RSGADFEVEN GEVLYALGAV RNVGLEAMKH
LVAIREEGGP FRDIFDFVER VDPKLVNKRA IENLARAGAF DSLSKNRAQI FASADVLIAH
GQSIAADRQG GQHALFGGDP AAGRPRLKKT EPWSQVDLLD EELAAVGFYL TGHPLDDMVG
VLRRRRTHML TEVIPRAEAG MEAFRMCGVV RRRQERASQS GERFAFVSLS DPSGEYEVLF
PPEALRKCRE VLEPGKAVAI KVRAKARDGE VRFFGDDAEP IEKAIENMVA GLRMHLSPSA
TEIDALRRRL EAAASPRGGE VSLIAALGGG REIEMKLPGR YTLDAALRGA LKTAPGVALL
EDV