Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4789 |
Symbol | |
ID | 5902251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5175565 |
End bp | 5177109 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641565309 |
Product | protein of unknown function DUF853 NPT hydrolase putative |
Protein accession | YP_001686407 |
Protein GI | 167648744 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.851867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGG TGATCGGCGA AGGCGAAATC CTCATCGGAC TTAGCGAGGG GGGGCTCGGC GAAGGACCCG TGACCCAGCG GCTCGACCGA TCCAATCGGC ACGGCGTGGT GGCCGGCGCG ACGGGCACGG GCAAGACCGT CACCCTGCAG GTGATGGCCC AGGCCTTTTC CGACGCCGGC GTGCCGGTGT TTGCCGCCGA CGTGAAGGGC GACCTGTCGG GCATCGCCGC CATCGGGACG CCCAACGACA AGATGCTGGC CCGCGCGGCG TCCATGGATC TGACCCTGAC TCCGGCCGCG CCGCCCGTCG TGTTCTGGGA CCTGTTCGGC CAGAAGGGCC ATCCGATCCG CGCCACCATC TCGGAGATGG GGCCGGTGCT GCTGTCGCGA CTGCTGGAAC TGAACGACGT GCAGGAGGGC GTGCTGACCG TGGTCTTCCA CGTCGCCGAC AAGGACGGCC TGCTGCTGCT GGACCTGAAG GACCTGCAGG CGGCCCTGAA ATACGTCGCC GACCACGCCG CCGAGATCGG CACCCAGTAC GGCAATGTCT CGCCGGCCAC GGTGGGCGCG ATCCAGCGCA AGCTGTTGAC CCTGCAAAGC CAGGGCGCCG AAAACTTCTT CGGCGAGCCG GCCCTGAAGC TGACCGACAT CATGCGCACC GATGTGGCGG GGCGCGGCTA CGTCAACCTG CTGGCCGCCG ATAAGCTGAT CCAGTCGCCC AAGCTCTATT CGACCTTCCT GCTATGGCTG CTGTCGGAGC TGTTCGAGGA GCTTCCCGAG GTCGGCGACC CCGACAAGCC CAAGCTGGTG TTCTTCTTCG ACGAGGCCCA CCTGCTGTTC AACGACGCGC CCAAGCCGTT GCTGGAGAAG ATCGAGCAAG TCGTGCGGCT GATCCGCTCC AAGGGGGTGG GCATCTATTT CGTCACCCAG AATCCGGCCG ACATTCCCGA CGCGGTGCTC GGCCAACTGG GCGCCCGCGT GCAGCACGCG CTGCGCGCCT ACACCCCCGC CGACCAGAAG GGGTTGAAGG CCGCCGCCCA GTCTTTCCGG GTCAATCCGG CCTTCGACAC CGCCGAGACC ATCCAGGCCC TGGGCGTGGG CGAGGCCCTG ATCTCCACCC TCGACGCCAA GGGCGCGCCC TGCGTGGTGC AAAAGACCCT GATCCGTCCA CCCGCCTCGC GCCTGGGTCC GCTGACGCCC GAGGAGCGCG TCGCCCTGAT CGCCAGAAGC CCGGTCGCCG GGCTCTATGA CCAGACGCTC GATCGCGCCT CCGCCTACGA GATCCTGCAG GGCCGGGCCG CCCAGGCCCA GCAGCAAGCC GACACGGTCG CCGCCGCGGC CGAGGCCCAG CGCCAACAGG CCGCCGCCGA AAAGGTCCGT GAACGTGAAG AGGCGGCGGA GGCCCGCGCC GCGCCGCGAC CGCGCGCCTC CAGCCGTCAG TCCATGGGCG AGGCCTTCGC CACCTCGCTG CTGCGCACGA TCGCCAACCA GGCGGGGCGA GAGATCATGC GCGGCCTGAT GGGCGGCATG AGTCGCCGAA GGTAG
|
Protein sequence | MTLVIGEGEI LIGLSEGGLG EGPVTQRLDR SNRHGVVAGA TGTGKTVTLQ VMAQAFSDAG VPVFAADVKG DLSGIAAIGT PNDKMLARAA SMDLTLTPAA PPVVFWDLFG QKGHPIRATI SEMGPVLLSR LLELNDVQEG VLTVVFHVAD KDGLLLLDLK DLQAALKYVA DHAAEIGTQY GNVSPATVGA IQRKLLTLQS QGAENFFGEP ALKLTDIMRT DVAGRGYVNL LAADKLIQSP KLYSTFLLWL LSELFEELPE VGDPDKPKLV FFFDEAHLLF NDAPKPLLEK IEQVVRLIRS KGVGIYFVTQ NPADIPDAVL GQLGARVQHA LRAYTPADQK GLKAAAQSFR VNPAFDTAET IQALGVGEAL ISTLDAKGAP CVVQKTLIRP PASRLGPLTP EERVALIARS PVAGLYDQTL DRASAYEILQ GRAAQAQQQA DTVAAAAEAQ RQQAAAEKVR EREEAAEARA APRPRASSRQ SMGEAFATSL LRTIANQAGR EIMRGLMGGM SRRR
|
| |