Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3607 |
Symbol | |
ID | 5901062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3890239 |
End bp | 3891528 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564118 |
Product | gluconate 2-dehydrogenase (acceptor) |
Protein accession | YP_001685232 |
Protein GI | 167647569 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.105717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGAAAC GTGTCCTCAT CGGATTGCTG GCGGTCGTGG CCGTAGGGCT GGTCGGCTTC GGCGTCTTCG CCTGGCGTCC CGCGATCGGC AAGATCGCGC CGCCCCCGCC ATCGGCCTTC TCGCCGGACC TGGTCGCGCG GGGCGAGGTT CTCGCGGGGG CCGGCTACTG TTCGACCTGC CACACCACCA AGGGCGGCCA GCCGTTCGCC GGCGGCTATC CGATGAAGAC CAGTTTCGGC GTGATCTATT CGACCAACAT CACCCCCGAC GCCAAGACCG GCATCGGAAC CTGGTCGGAG GCCGCGTTCC GCCGGGCGAT GCATCAGGGC GTGGCCCGCA ACGGCGCGCA CCTGTTTCCG GCCTTCCCCT ACGATCATTT CACCAAGCTG TCCGACGCCG ACGTCTCGGC GCTCTACGCC TACATGATGA CCCGCCCGGC GGTGGTCGCC CCGGCCAAGC GCAACGGCAT ACCGTTCCCG CTCAACATCC GCGCTCTGCA GGCGGGCTGG AAGCTGTTGT TCTTCAAGCC TGGACGCTTC GTGCCGGACA GCGGCAAGAG CGCCGAATGG AACCGCGGCG CCTATCTGGC CCAGGGCGTC AGCCACTGCG GCGCCTGCCA CACGCCGCGC GGGGCGCTGG GGGCCGAAAA GCGCGACAAG GCCTTCGCCG GGGCTCCGAT CGACAACTGG ATCGCCCCGC CCCTGACCGC CGCCAATCCC TCGCCGGTCG CCTGGGACCA GGCCGAACTG GTCGCCTATC TGCGCACCGG CGTCAGCCTC TATCACGGCG TCGCCGCCGG CCCGATGGCG CCGGTGGTCC ACGATGGACT CGTCAGATTG CCGGACGCCG ACATCCAGGC CCTGGCGACC TATTTCGTGG CCGTCGACGG GGCGGCGAGC CGGAGCGCTA GCCTGGCGAC GGCCCTGCAA AAGGCGGCGA CGGCCGACCG GCTGAACGTC GGGACCCAGA TCGACCCGGC CGCGCGGCTC TACACCGCCG CCTGCGCCTC CTGCCACTAT AACGGCGGCG GTCAGCCCAA CCCGCTGCGG CCGGACCTGG CGCTCAACAG CGCGGTCAGT CTGGATGACC CGACCAACCT GATCCGGGTG GTGCTCTACG GGGTCAGCGC CAGGGACGGC GCGCCGGGCG TGGTGATGCC CAGCTTCAAC CGGTTCAGCG ACGCCGACGT CGCGACGTTG GCGGCCTATC TGCGCGCCAC CCGCACCGAC AAGCCGGCCT GGCCGAAACT GACCGACAAG GTCGCGGCGA TCCGCGCGCA GGGAAGGTGA
|
Protein sequence | MLKRVLIGLL AVVAVGLVGF GVFAWRPAIG KIAPPPPSAF SPDLVARGEV LAGAGYCSTC HTTKGGQPFA GGYPMKTSFG VIYSTNITPD AKTGIGTWSE AAFRRAMHQG VARNGAHLFP AFPYDHFTKL SDADVSALYA YMMTRPAVVA PAKRNGIPFP LNIRALQAGW KLLFFKPGRF VPDSGKSAEW NRGAYLAQGV SHCGACHTPR GALGAEKRDK AFAGAPIDNW IAPPLTAANP SPVAWDQAEL VAYLRTGVSL YHGVAAGPMA PVVHDGLVRL PDADIQALAT YFVAVDGAAS RSASLATALQ KAATADRLNV GTQIDPAARL YTAACASCHY NGGGQPNPLR PDLALNSAVS LDDPTNLIRV VLYGVSARDG APGVVMPSFN RFSDADVATL AAYLRATRTD KPAWPKLTDK VAAIRAQGR
|
| |