Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1643 |
Symbol | |
ID | 5899098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1724367 |
End bp | 1725602 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562132 |
Product | nucleotide diphosphatase |
Protein accession | YP_001683270 |
Protein GI | 167645607 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0839592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTATTG TCTGCCGCTG GATGGCCCTG GTCCTGGCCG TGGTGCTCGG CGCCTGCGCC ACGCCTTCGT TTGCAGCGAC GCCGAAAGCG CCCCTGACCA TCCTGATCTC GCTGGACGGC TTCCGCGCCG ACTATCTGGA CCGTGGCGAC ACCCCGACCT TGTCGGCCCT GGCCGCCGAC GGCGCGCGCG GGGCCATGCG GCCGTCGTTC CCGTCCCTGA CCTATCCGAA CCACTACACC CTGGTGACCG GCAAGCGGCC CGACCATCAC GGCATGGTCA ACAACACCCT GGAGGACGCC GACATCGGCG TCGCCTTCAG CATGTCGAAC AAGGAAGCGG TCGGCGATGG ACGCTGGTGG GACCAGGCCA AGCCGATCTG GGTCAGCGCG GAGCAGCAGG GCGTGCACGC CGGAACCCTG TTCTGGCCGG GCTCGGAAGC GCAGATCGAC GGGGTGCGCC CCAGCCGCTG GCAGGTGTTC GACATGAAGA TCCCGTCCAA CGACCGGGTC GACACCCTGC TGTCGTGGAT CGACGCGAAG GATCCGCCGC TCGGCTTCGC CACCCTCTAT TTCGACCGCA CCGACACCGA GGGCCACCAC TATGGCCCGG ACTCGCCGGA GGTGAACGCC GCCGCCGCCG AGGTCGATGC GGCCATCGCT CGGCTGGTGG CGGGGCTGAA GGCGAGGGGC CTGTTTGACA CCACCAACAT CGTGGTCGTC GCCGACCATG GCATGGCGCC CCAGCCGCTG AGCGGGCTGG TCGATGTGGC GACCTTGATC GACCCCGCCA AGGTCAAGTT CGTCAGCACG GGTTCGATCG TCGCCGTCCG CGCCGTGCCC GGCTTCGAGG CCGAGGTCGC GGCCACGATG CTCAAGGCCC ACCCGCACCT GACCTGCTGG GAAAAGGCCA GGATCTCGGC CCGGTACGGG TATGGGACCA ACCCGCGCGT GCCGCCGATT GTCTGCCTGG CCGAGCGCGG CTGGTACTTC GTCACGGCGT CCGCGCTCAA GAAGCGCCTG GAGGAGCATC CGCGCGACGG TGGGGCGCAC GGCTACGACC CCAGTGACCC GACCATGCGG GCGGTGTTCG TCGCCCATGG TCCTGCGTTC AAGCGTGGCG TGGTGCTACC GGTGTTCGAC AATGTCGACG TCTATCCGCT GCTGACCCGG CTGATCGGCG TGAAGGGCGA CAAGGGCGAC GGCGCGCTGG GGCCGGTGAA GGCGGCGCTG CGCTAG
|
Protein sequence | MVIVCRWMAL VLAVVLGACA TPSFAATPKA PLTILISLDG FRADYLDRGD TPTLSALAAD GARGAMRPSF PSLTYPNHYT LVTGKRPDHH GMVNNTLEDA DIGVAFSMSN KEAVGDGRWW DQAKPIWVSA EQQGVHAGTL FWPGSEAQID GVRPSRWQVF DMKIPSNDRV DTLLSWIDAK DPPLGFATLY FDRTDTEGHH YGPDSPEVNA AAAEVDAAIA RLVAGLKARG LFDTTNIVVV ADHGMAPQPL SGLVDVATLI DPAKVKFVST GSIVAVRAVP GFEAEVAATM LKAHPHLTCW EKARISARYG YGTNPRVPPI VCLAERGWYF VTASALKKRL EEHPRDGGAH GYDPSDPTMR AVFVAHGPAF KRGVVLPVFD NVDVYPLLTR LIGVKGDKGD GALGPVKAAL R
|
| |