Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0050 |
Symbol | |
ID | 5897762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 62648 |
End bp | 64834 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641560533 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_001681686 |
Protein GI | 167644023 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.151255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.855639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGC TCTTTGGCAA GTCGTCGCCC CAGGACGGGG TCAGCCGCCG CGACCTGGTG GTCGGCGCGA CTCTGGTCGG CGGCGCCCTG CTGGTCGGCT GTTCGCCCGC CGCCCTGATG AGCGCCGGCT CCAAGGTCGA TGTCGGGGCC TTTGGACCGT TCATCCGCTT CGACCCCGAC GGGGCGGTCA CGGTGCTGTC CAAGCACATC GAATTTGGCC AGGGCAACCA CGCCGGCCTC GCCGCCATCG TCGCCGAGGA GCTGGACGCC GACTGGAGCC AGGTGAAGGT CGTGCACGCT CCGGCCAACG CCAAGCTTTA TGGCAACAAG GGCGCGGGCA TCCAGCTGAC GGGCGGCTCG TCGGCGATCT CCAATTCCTG GGAACAGCTG CGCCAGGCTG GAGCCGGGGC GCGGGCGATG TTCGTGCAGG CGGCGGCGAC GAAGTGGAAC GTTCCCGCCA GCCAGATCAC CGTCAAGGAC AGCGTCGTCA GCGGCGGCGG CAAGAGCGCG GGTTTCGGCG AGCTGATCGC CGACGCCGCC AAGGTCACGC CGCCCGAGAC CCCAACGCTG AAGGACCCCA AGACCTTCAC CCTGATCGGC ACCGACCGGG TGCGCCGCAA GGACAGCCTG ATCAAGAGCA ACGGCACGGC CCGCTATACC CAGGACGTCC AATTGCCCGA CATGCTGGTG GCCATGGTCG CCCATGCGCC GCGCTTCGGG GCGAGCGTGA AGAGCTTCAA TGGCGACGAC GCCAAGAAGG TGGCGGGCGT GGTCGAGGTC TATCAGATCC CGACCGGGAT CGCGGTGGTC GCCGACAGCA CCTACGCCGC CCGCATGGGT CGCGAGGCGC TGAAGGTCGA GTGGGACGAG GGCAAGGTCG ATACCCGAAG CTCGGCGACG ATCGCCGGAC AGTGGCGCGA CATCGCCGCC GGCAAGGGTC CGTCGGACCT GAAGTGGGAG GCCTTCGACT CCAAGGGCGA CGCGGCCGCG GCCTCGGCGG GAAAGGGCGC CTCGGTCTTC GAGACCACCT ATGACTTCCC CTACCTGGCC CACGCCACCA TGGAGCCGAT GAACTGCGTG GCGGTGGTCG ATGGCGGCAA TGTCAAGCTG ATCCATGGCT CTCAGGCCCA GACCCTGGAC CAGATGGCCG CCGCCAAGAT CGTCAAGACC CTGCCCGGCT CGGTCGAGAT CGAGACCCTG TTCGCCGGCG GCTCGTTCGG CCGACGGGCC AATTTCCAGT CCGACTATGT CGCCGAGTGC GTGCACATCG CCCAGAAGGT CGGCGGCGGC CGTCCGGTGA AACTGATCTG GACCCGCGAG GACGACATGC GGGCCGGCTA TTTCCGTCCG CTGGTCCATC ACGCCGTGCG GGTGACGCTC GACAAGGACG GCTATCCGGC CACCTGGCGC CACCGGATCG TCAGTCAGTC GATCATGAAG GGCTCGCCCA TGCCCGCCAA GGGTCCCGAC CAGACCGCGA TCGAGGGGAC GGCCGGCTCG CCCTATCTGA AGGCCACCCC GATCGTCGAT GCGCAGCTAG CCTTGCCGGA AGCCGGCGTG CCGGTGCTGT GGTGGCGCTC GGTGGGGGCG ACCCATACCG CCTTCGTCAT GGAGCACACC ATCGACCAGC TGGCCCGCAA GGCTGGCAAG GCTCCGATCG ACTACCGTCG CGCCCTTTAC GCCAAGGCAG GGGCCGACCG GCACCTGGCG GCGTTGAACC TGGCGGTCGA GAAGGCCGGC CCGGCCCCGG CCGCCGGCTG GACGCGGGGG GTGGCGGTGC ACGAGAGCTT CGGTTCGGTG GTCGCCCAGG TCGCCGAGGT CAAGCTGGTG GACGGCCAGC CGCGCGTCGG TCGGGTGGTC ACCGCCATCG ACTGCGGCGT CGCGGTCTCG CCCGATCAGA TCGCCGCCCA GATGGAAGGT GGAACCTGCT ACGGCCTGTC GGCGGCGCTG TACGGCGAGA TCACCCTGAC CGACGGGGCG GTCGACCAGA GCAATTTCGA CACCTATCGC GTGCTGCGAA TGAACGAGGC GCCGACGGTC GAGACGCACA TCGTGCCGTC GGGAAATCCG CCCAGCGGAG TGGGGGAACC TGGAACGCCG GTGATCGGAC CGGCCGTGGC CAACGCCCTG CTGGCGATCT CCAACACGCC GACCACCCGC CTGCCCTTGG TCCGCGCGAT GGCTTGA
|
Protein sequence | MNALFGKSSP QDGVSRRDLV VGATLVGGAL LVGCSPAALM SAGSKVDVGA FGPFIRFDPD GAVTVLSKHI EFGQGNHAGL AAIVAEELDA DWSQVKVVHA PANAKLYGNK GAGIQLTGGS SAISNSWEQL RQAGAGARAM FVQAAATKWN VPASQITVKD SVVSGGGKSA GFGELIADAA KVTPPETPTL KDPKTFTLIG TDRVRRKDSL IKSNGTARYT QDVQLPDMLV AMVAHAPRFG ASVKSFNGDD AKKVAGVVEV YQIPTGIAVV ADSTYAARMG REALKVEWDE GKVDTRSSAT IAGQWRDIAA GKGPSDLKWE AFDSKGDAAA ASAGKGASVF ETTYDFPYLA HATMEPMNCV AVVDGGNVKL IHGSQAQTLD QMAAAKIVKT LPGSVEIETL FAGGSFGRRA NFQSDYVAEC VHIAQKVGGG RPVKLIWTRE DDMRAGYFRP LVHHAVRVTL DKDGYPATWR HRIVSQSIMK GSPMPAKGPD QTAIEGTAGS PYLKATPIVD AQLALPEAGV PVLWWRSVGA THTAFVMEHT IDQLARKAGK APIDYRRALY AKAGADRHLA ALNLAVEKAG PAPAAGWTRG VAVHESFGSV VAQVAEVKLV DGQPRVGRVV TAIDCGVAVS PDQIAAQMEG GTCYGLSAAL YGEITLTDGA VDQSNFDTYR VLRMNEAPTV ETHIVPSGNP PSGVGEPGTP VIGPAVANAL LAISNTPTTR LPLVRAMA
|
| |