Gene Caul_4462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4462 
Symbol 
ID5901923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4834076 
End bp4835653 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content73% 
IMG OID641564981 
Producthypothetical protein 
Protein accessionYP_001686080 
Protein GI167648417 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.234838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCA TCCTTTCCGT CTGGTGTCGC AATTGGCCGA TCACGACGTG GCGGCGAGCG 
AACCCGAGCT TCGGCTCGTC GGCTGAGGTC AAAGCCTCCC CCCATGGGGG AGGCTTTATA
CTCCCGCCCC TCGCCCTGGT CGCCACCGAG GGCGGAACCC GCCGCCTGGC CGCCGTCGAC
GACGCGGCCG CCGCTCTGGG CCTGCACGTC GGCCAGAAGA CCGCCGACGC CGCGGCCCTG
GTTCCGGGCC TGGTCACCGC CGACCATGAC CCCGAGGGCG ACCGCGCGGC GCTGGAGATC
CTCTGCGACT GGTGCGTGCG CTTCTCGCCG GCCGTGGCCA TCGACGGGCT GGACGGCCTG
TTCCTCGACG TCGAGGGCGT CTCGCACCTG TGGGGCGGGG AGGCGGCGAT GCTCGACGAC
CTGCTGGCCC GGCTGGAGCG CTGGGGCGCG CCGACGAGGG GCGCGATCGC CGACACCCCC
GGCGCGGCCT GGGCCCTGGC CCGCTACGCG CCAGATCGCA CCATAGCCTC GCCCGGCGGC
CAAGGCCCGC TGCTGGCCCC GCTGCCAGTC GCAGCCCTGC GGCTGGACGA GGCGGGCCAG
GCCCAGCTGC CGCGCCTGGG CCTGTTCCAT GTCGGCCAAT TGCTGGCCCT GCCGCGCGCC
CAGTTGGCCA AGCGCTTCGG CCTGAGCGCG GTGCTACGCA TCGACCAGGC CCTGGGCGCG
GCCCGCGAGG CCCTGACCTT CCGGCGTCCC GCCACCCCGT GGTTCGACCG CCTGGCCTTC
TTCGAGCCGA TCAGCGCCCT GGAGGACCTG GAGCGGGTGA CGGGCGACAT CTGCGCCCTG
CTCTGCGCCC GGCTGGAGGC CGAGGGCCAG GGCGCGCGGC GGTTCGAGCT GGTCTTCCAC
CGCCTGGACG GCCGCGACTA TCCGCTGCGC GTGGGCCTGT CGCGTCCCGG CCGCGACGCC
GCCCGCGTCG CCAAGCTGCT GAAGCCGAAA CTGGAAACGG TCGATCCCGG CTTCGGGATC
GAGGTGGTCA CCCTGTGGGC CGCCGATGTC GAGCCGCTGT CCACCGCGCA AAGAAATCTA
GGAGGGGGCA GCCTGGACGC CGACGGCGGG GTCAGCCTGG AGGAGGGCCT GGCGCCGCTG
ATCGACCGGC TGGTCAACCG TCTGGGCGAG GACCGGGTCT GGCGGGCTGA TCCCCATGAG
AGCCACGTGC CCGAGCGTTC GGTGACGCGC GCCGCCCCGC TGGATCCGGC GCCGGAAGCG
GCCTGGGACC CCGAGCGGCC GCGACCGACC CGGCTGCTGC GCCGCCCTGA GGCGATCACG
GTCATGGCCC AGTTGCCCGA CGAACCGCCG GCCCATTTCA CCTGGCGAGG CCAGCGCCAT
CGCGTGCGTC ATGCCGAGGG ACCCGAGCGG ATCGGCCAGG AGTGGTGGCG CAAGGCCTTC
GACGGCGTCG GGCCGAGCAA GATCCGCGAC TATTACCGGG TCGAGGACGA GGCCGGCGGC
CGGTTCTGGA TCTACCGCCA GGGCCTCTAC GGCGTGGGCG ACGAACCGAA GTGGTGGTTG
CATGGCCTGT TTGGGTAG
 
Protein sequence
MARILSVWCR NWPITTWRRA NPSFGSSAEV KASPHGGGFI LPPLALVATE GGTRRLAAVD 
DAAAALGLHV GQKTADAAAL VPGLVTADHD PEGDRAALEI LCDWCVRFSP AVAIDGLDGL
FLDVEGVSHL WGGEAAMLDD LLARLERWGA PTRGAIADTP GAAWALARYA PDRTIASPGG
QGPLLAPLPV AALRLDEAGQ AQLPRLGLFH VGQLLALPRA QLAKRFGLSA VLRIDQALGA
AREALTFRRP ATPWFDRLAF FEPISALEDL ERVTGDICAL LCARLEAEGQ GARRFELVFH
RLDGRDYPLR VGLSRPGRDA ARVAKLLKPK LETVDPGFGI EVVTLWAADV EPLSTAQRNL
GGGSLDADGG VSLEEGLAPL IDRLVNRLGE DRVWRADPHE SHVPERSVTR AAPLDPAPEA
AWDPERPRPT RLLRRPEAIT VMAQLPDEPP AHFTWRGQRH RVRHAEGPER IGQEWWRKAF
DGVGPSKIRD YYRVEDEAGG RFWIYRQGLY GVGDEPKWWL HGLFG