Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4368 |
Symbol | |
ID | 5901829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4744378 |
End bp | 4745430 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641564886 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001685986 |
Protein GI | 167648323 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.468133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA TCCCCGCCCT CCGCGCCGCC CTGCTGGCCT GGTACGACGC CCAGGCGCGG GACCTGCCCT GGCGGACCGG GCCGGCGGCC GGCAAGGCGG GACAGCGGTC CGACCCTTAC CGCGTCTGGC TGTCGGAGGT GATGCTGCAG CAGACCACCG TGCCGCACGC CACGCCCTAT TTCCTGAGTT TCACCCAGCG CTGGCCGACG GTCTCAAGCC TGGCGGCGGT GGCGGACGAC GACCTGATGG CCGCCTGGGC GGGCCTGGGC TACTACGCCC GCGCCCGCAA CCTTCTGGCC TGCGCCCGGG CCGTGGCGGC TGAGCACGGC GGGGTGTTTC CCGACACCGA GGCGGCCCTG CGCGCCCTGC CGGGCGTCGG CGCCTACACC GCCGCCGCCG TTGCGGCCAT CGCCTTCGAC CGCGAGGCCA ACGTGGTCGA CGGCAATGTC GAGCGGGTGA TGGCGCGGCT GTTCGCGGTG GAAGACCCCG TGCCCGACGC CAAGCCGGAG CTGAAGCGCC TGGCCGGCGA GCTGGTCACC GCCGCGCGTC CCGGCGACTG GGCCCAGGCG CTGATGGACC TGGGCGCGAC GGTGTGCCGG CCCAAGGGTC CGCTGTGCGA CCGCTGCCCG GTCTCGGCCT GGTGCGAGGG CTTCAAGACC GGCGCGCCGG AGACCTATCC GCGCAAGACG AAGAAGGCCG AACGGCCTCG CCGCTACGGG GTGGCCTATG TCCTGACGCG GGGCGAGGCC ACGGCCCTGG TCCGCCGCCC GCCCAAGGGC CTGCTGGGCG GGATGCTGGG CCTGCCGACC AGCGACTGGC GCGATCGTCC GTGGACGGAT TTCGAAGCCG CCGCCACCGC GCCGGCGGCC GGCGCCTGGC GCGACTTCGG CGCGGTCGAG CACGTCTTCA CCCACTTCTC GCTCACGCTG CGAGTGCTGC GGGCCGAGAG CAACGGCGAG GGCGACTTCG TCTGGACCGA TCCAGCGGGG CTGGCCGCGC TGCCCAGCGT ATTTCTGAAG GCCGCGAAGG CGGGGCGGGC GCGACTGGTC TAA
|
Protein sequence | MLDIPALRAA LLAWYDAQAR DLPWRTGPAA GKAGQRSDPY RVWLSEVMLQ QTTVPHATPY FLSFTQRWPT VSSLAAVADD DLMAAWAGLG YYARARNLLA CARAVAAEHG GVFPDTEAAL RALPGVGAYT AAAVAAIAFD REANVVDGNV ERVMARLFAV EDPVPDAKPE LKRLAGELVT AARPGDWAQA LMDLGATVCR PKGPLCDRCP VSAWCEGFKT GAPETYPRKT KKAERPRRYG VAYVLTRGEA TALVRRPPKG LLGGMLGLPT SDWRDRPWTD FEAAATAPAA GAWRDFGAVE HVFTHFSLTL RVLRAESNGE GDFVWTDPAG LAALPSVFLK AAKAGRARLV
|
| |