Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4462 |
Symbol | |
ID | 5901923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4834076 |
End bp | 4835653 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564981 |
Product | hypothetical protein |
Protein accession | YP_001686080 |
Protein GI | 167648417 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.234838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCA TCCTTTCCGT CTGGTGTCGC AATTGGCCGA TCACGACGTG GCGGCGAGCG AACCCGAGCT TCGGCTCGTC GGCTGAGGTC AAAGCCTCCC CCCATGGGGG AGGCTTTATA CTCCCGCCCC TCGCCCTGGT CGCCACCGAG GGCGGAACCC GCCGCCTGGC CGCCGTCGAC GACGCGGCCG CCGCTCTGGG CCTGCACGTC GGCCAGAAGA CCGCCGACGC CGCGGCCCTG GTTCCGGGCC TGGTCACCGC CGACCATGAC CCCGAGGGCG ACCGCGCGGC GCTGGAGATC CTCTGCGACT GGTGCGTGCG CTTCTCGCCG GCCGTGGCCA TCGACGGGCT GGACGGCCTG TTCCTCGACG TCGAGGGCGT CTCGCACCTG TGGGGCGGGG AGGCGGCGAT GCTCGACGAC CTGCTGGCCC GGCTGGAGCG CTGGGGCGCG CCGACGAGGG GCGCGATCGC CGACACCCCC GGCGCGGCCT GGGCCCTGGC CCGCTACGCG CCAGATCGCA CCATAGCCTC GCCCGGCGGC CAAGGCCCGC TGCTGGCCCC GCTGCCAGTC GCAGCCCTGC GGCTGGACGA GGCGGGCCAG GCCCAGCTGC CGCGCCTGGG CCTGTTCCAT GTCGGCCAAT TGCTGGCCCT GCCGCGCGCC CAGTTGGCCA AGCGCTTCGG CCTGAGCGCG GTGCTACGCA TCGACCAGGC CCTGGGCGCG GCCCGCGAGG CCCTGACCTT CCGGCGTCCC GCCACCCCGT GGTTCGACCG CCTGGCCTTC TTCGAGCCGA TCAGCGCCCT GGAGGACCTG GAGCGGGTGA CGGGCGACAT CTGCGCCCTG CTCTGCGCCC GGCTGGAGGC CGAGGGCCAG GGCGCGCGGC GGTTCGAGCT GGTCTTCCAC CGCCTGGACG GCCGCGACTA TCCGCTGCGC GTGGGCCTGT CGCGTCCCGG CCGCGACGCC GCCCGCGTCG CCAAGCTGCT GAAGCCGAAA CTGGAAACGG TCGATCCCGG CTTCGGGATC GAGGTGGTCA CCCTGTGGGC CGCCGATGTC GAGCCGCTGT CCACCGCGCA AAGAAATCTA GGAGGGGGCA GCCTGGACGC CGACGGCGGG GTCAGCCTGG AGGAGGGCCT GGCGCCGCTG ATCGACCGGC TGGTCAACCG TCTGGGCGAG GACCGGGTCT GGCGGGCTGA TCCCCATGAG AGCCACGTGC CCGAGCGTTC GGTGACGCGC GCCGCCCCGC TGGATCCGGC GCCGGAAGCG GCCTGGGACC CCGAGCGGCC GCGACCGACC CGGCTGCTGC GCCGCCCTGA GGCGATCACG GTCATGGCCC AGTTGCCCGA CGAACCGCCG GCCCATTTCA CCTGGCGAGG CCAGCGCCAT CGCGTGCGTC ATGCCGAGGG ACCCGAGCGG ATCGGCCAGG AGTGGTGGCG CAAGGCCTTC GACGGCGTCG GGCCGAGCAA GATCCGCGAC TATTACCGGG TCGAGGACGA GGCCGGCGGC CGGTTCTGGA TCTACCGCCA GGGCCTCTAC GGCGTGGGCG ACGAACCGAA GTGGTGGTTG CATGGCCTGT TTGGGTAG
|
Protein sequence | MARILSVWCR NWPITTWRRA NPSFGSSAEV KASPHGGGFI LPPLALVATE GGTRRLAAVD DAAAALGLHV GQKTADAAAL VPGLVTADHD PEGDRAALEI LCDWCVRFSP AVAIDGLDGL FLDVEGVSHL WGGEAAMLDD LLARLERWGA PTRGAIADTP GAAWALARYA PDRTIASPGG QGPLLAPLPV AALRLDEAGQ AQLPRLGLFH VGQLLALPRA QLAKRFGLSA VLRIDQALGA AREALTFRRP ATPWFDRLAF FEPISALEDL ERVTGDICAL LCARLEAEGQ GARRFELVFH RLDGRDYPLR VGLSRPGRDA ARVAKLLKPK LETVDPGFGI EVVTLWAADV EPLSTAQRNL GGGSLDADGG VSLEEGLAPL IDRLVNRLGE DRVWRADPHE SHVPERSVTR AAPLDPAPEA AWDPERPRPT RLLRRPEAIT VMAQLPDEPP AHFTWRGQRH RVRHAEGPER IGQEWWRKAF DGVGPSKIRD YYRVEDEAGG RFWIYRQGLY GVGDEPKWWL HGLFG
|
| |