Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1508 |
Symbol | |
ID | 5898963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1603608 |
End bp | 1604708 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641561995 |
Product | DNA protecting protein DprA |
Protein accession | YP_001683136 |
Protein GI | 167645473 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.300262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGGG GCCTGTCCGA CAAAGAGCGC CTGGCCTGGC TGAGGCTGGC GCGCACCGAG ACGGTCGGCC CCGTCGCCTT CGACCACCTG TTGACGCGGT TTGGTTCGGC CGAACGCGCC CTGATCGCCC TGCCCGAACT GGCCCGGCGT GCCGGCCGCG CCTCGCCCCT GCGCCCGCCG CCCGAGGGCG AGATCCTGGG CGAACTCGAG ATCGGCGACC GCCTCGGCGC GCGGCTGATC TGCGCCTGCG AACCCGACTA TCCGCCCCGG CTGGCCGCCC TCGACCCGCC GCCGCCCGTG CTGTGGGCCC TGGGCCATGC CGATCTGCTG TCCCAGCCCA GCCTGGCCAT CGTCGGGGCG CGGATCGCCT CGGCCGGCGG CCAGCGCTTC GCCCGCCAAC TGGCCACCGA GCTGGGTCGG CACGGCTATG TGGTGGTCTC GGGGTTAGCG CGCGGCGTCG ACGGCGCGGC CCACGAGGGC GCCCTGGCGA CCGGCACCGT CGCCGTGCTG GGCGGCGGGG TGTGCGACAT CTATCCGCCC GAGCACGCCG CCCTGCACGC CCGCATCGCC GGCGAGGGCG GCTGCATCGT CAGCGAAAGC GCCCCCGACC GCCGCGCCAT CGCCAAGGAC TTCCCGCGCC GCAACCGCAT CATCTCGGGC TTGTCGCTGG GCGTGGTGGT GGTCGAGGCC GAGCTGAAGT CCGGCTCGCT GATCACCGCC CGCCTGGCCG CCGAGCAGGG TCGCGACGTG TTCGCCGTGC CCGGCTCGCC GCTCGACCCG CGCGCCAGGG GTCCCAACGA CCTGATCCGC CAGGGCGCGA TCCTCTGCGA GGGCGTCGAG GACGTGCTGC GCTCGCTGTC GGGCCAGGCC CACCTGCGCG AGCGCGAACG CCCCTACGCG GCCGAGGACG ACGACGCCGA GATCGACCAC GAGGCCCTGC GCGAGGAAGT CGCCCGCCTG CTATCGCCCA CCCCGGTCTC GCGCGACGAC CTGGTCCGCG CCACCCGCGC CCCGACCTCG GCGGTGATGG CGGCCTTGGT GGAACTGGCC CTGGCCGAGC GGGTGGAACT GCTGCCGGGC GGGATGGTGG CGGGGGTTTG A
|
Protein sequence | MTRGLSDKER LAWLRLARTE TVGPVAFDHL LTRFGSAERA LIALPELARR AGRASPLRPP PEGEILGELE IGDRLGARLI CACEPDYPPR LAALDPPPPV LWALGHADLL SQPSLAIVGA RIASAGGQRF ARQLATELGR HGYVVVSGLA RGVDGAAHEG ALATGTVAVL GGGVCDIYPP EHAALHARIA GEGGCIVSES APDRRAIAKD FPRRNRIISG LSLGVVVVEA ELKSGSLITA RLAAEQGRDV FAVPGSPLDP RARGPNDLIR QGAILCEGVE DVLRSLSGQA HLRERERPYA AEDDDAEIDH EALREEVARL LSPTPVSRDD LVRATRAPTS AVMAALVELA LAERVELLPG GMVAGV
|
| |