Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1417 |
Symbol | |
ID | 5898872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1505788 |
End bp | 1507971 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561904 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_001683045 |
Protein GI | 167645382 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGA TCGTTCTCGA CAACCCCTCG CGTCGCCTGT TCCTGAAGAC CGGCGCGGCG GCCGGCGGCG GGCTGCTGCT GTCGTTCAAC CTGCCGGGCC TGGTCCGGGC CGAGGGCGAG GCCGCGGCCA ACCCGTTCAA CGCTTTCGTC CGCATCGCGC CGGACGGCGT CGTGACCATC GCCGCCAAGC GGCCGGAGAT CGGGCAGGGG ATCAAGACCT CGATGCCGAT GCTGATCGCC GAGGAGCTGG ACGTCGACTG GTCCAGCGTT CGCATCGAGC AGGCCCCGAT CGACGCCAAG GTGTTCGGCG AACAGTCGGC CGGCGGCAGC ACCTCCACGC CCGACGACTG GGACCCGATG CGCCAGGTCG GGGCGGTGGG GCGAGCCTTG CTGATCCAGG CGGCCGCTCA GCGCTGGAAC ATTCCCGCCG CCGGTCTGGC GACCGAACCC GGCTTCGTGA TCGACAAGGC CGCCAAGCGC CGCGCCTCCT ACGGTGAGCT GGCCCAGGCC GCCGCCAGCC TGGCCGCGCC CGATCCCAAG ACCCTGACGC TGAAGGATCC CAAGACCTAC CGCATCATCG GCAAGCCCCA GCCGCAGTCG GATACGCCGG CGATCGTCAC CGGCCAGCCG CTCTACGGGA TCGATGTGCG CGTGCCGGGC ATGCTGTACG CCACCTTCCT CAAGGCCCCG GTGTTCGCCG CCAAGGTGGC CAAGGTCGAT CTGGCGCCGG CCAAGGCGGT CAAGGGCGTG CGTCACGCCT TCGTGGTCGA TGGCGGGACC GAACTGGAGG GCCTGCTGGG CGGCGTCGTC GTGGTCGCCG ACACCTGGTG GGCGGCGCGC AAGGGCCGCG ACGCGCTGGA GGTGACCTGG GCCGATCACC CGACCAGCGC CCAGAGCAGC GCCGGCTTCG TCGCCAAGGC CGCCGAGTTG CACGGCCAGC CGCCGCACCG CAAGGTCAAG GCGGTCGGCG ACGTCGACGC CGCGCTGCGG GGCGCGGCCA AGCTCGTCCG GGCGGACTAC AGCTACCCGT TCAACGCCCA CGCCACGCTC GAGCCGCAGA ACTGCACGGC CAGCTTCAAG GACGGCAAGG TCGAGATCTG GGCCCCGGCC CAGAACCCCG AGAACGGCCG CGAGCTGGTG GCCAAGACCC TGGGCGTCAA GCCCGACGAC ATCACCATCC ACTTCACCCG CAGCGGTGGC GGCTTCGGCC GGCGGCTGAT GAACGACTTC ATGGTCGAGG CGGCGTGGAT CTCCAAGGTG GTCGGGGCGC CGGTCAAGCT GCTGTGGACC CGCGAGGACG ACATGCAGCA CGACTTCTAC CGGCCGGCCG GCTTCCACCG GCTGACGGCG GGGCTGGACG CCCAGGGCGG GCTGACCGCC TGGCGCAACC ACTTCGTGAC CTTCGGCGAG GGCGACAAGT TCGTTCGAGC CGCCGGCATG AACGCCACCC AATTCCCGTT CGGGGCGGTC GATAACTATG CCCTGGACGT CTCGGTCATG CCGCTGGGCG TGCCGACTGG CTGGCTGCGG GCTCCGGGCA ACAACGCCTA TGGCTTCGTG CTGCAGGGCT TCACCGACGA GGTGGCCCAT GCGGCGGGCG CCGATCCGGT GGCGTTCCGT CAAAAGCTGC TGGGCGAGCC GCGGCTGATC GGCGAGCCGG GCAAGGGCGA CAGCTTCCAC ACGGGCCGCA TGCGCGCCGT GCTGGACCTC GTCGCCGACA AATCCGGCTG GGGCCGCAAG ACCCCCAAGG GCGTCGGCCT GGGCGTGGCC TGCCACTACA GCCACCGGGG CTATGTGGCG GTGGTGATGG AGGTCGCGGT CGCCGACGGC AAGCCCCGGG TCCGCAAGGC CTGGGCGGCG GTCGATGTCG GCCGGCAGAT CGTCAATCCG AACGGCGCCG AGCAGCAGGT GCAGGGCTCG GTCCTGGATG GGATCAGCCT GGCGCTGGGG CAGGAGATCA CCATCGAGAA CGGGGCTGTG ACCCAGAGTA ATTTCGGCGA CTTTCCGCTG CTGCGCTTCG CCGACGCGCC GGACGTCGAG GTGCATTTCG TCACCAGCGA CAACCCTCCC ACCGGCCTTG GCGAGCCCGC CCTGCCGCCG TCGCCGGCAG CCTTGGTCAA CGCCATCTTC GCGGCGACCG GCAAACGGAT ACGCGCCCTG CCGATCGGCG ACCAACTGGC CTGA
|
Protein sequence | MTAIVLDNPS RRLFLKTGAA AGGGLLLSFN LPGLVRAEGE AAANPFNAFV RIAPDGVVTI AAKRPEIGQG IKTSMPMLIA EELDVDWSSV RIEQAPIDAK VFGEQSAGGS TSTPDDWDPM RQVGAVGRAL LIQAAAQRWN IPAAGLATEP GFVIDKAAKR RASYGELAQA AASLAAPDPK TLTLKDPKTY RIIGKPQPQS DTPAIVTGQP LYGIDVRVPG MLYATFLKAP VFAAKVAKVD LAPAKAVKGV RHAFVVDGGT ELEGLLGGVV VVADTWWAAR KGRDALEVTW ADHPTSAQSS AGFVAKAAEL HGQPPHRKVK AVGDVDAALR GAAKLVRADY SYPFNAHATL EPQNCTASFK DGKVEIWAPA QNPENGRELV AKTLGVKPDD ITIHFTRSGG GFGRRLMNDF MVEAAWISKV VGAPVKLLWT REDDMQHDFY RPAGFHRLTA GLDAQGGLTA WRNHFVTFGE GDKFVRAAGM NATQFPFGAV DNYALDVSVM PLGVPTGWLR APGNNAYGFV LQGFTDEVAH AAGADPVAFR QKLLGEPRLI GEPGKGDSFH TGRMRAVLDL VADKSGWGRK TPKGVGLGVA CHYSHRGYVA VVMEVAVADG KPRVRKAWAA VDVGRQIVNP NGAEQQVQGS VLDGISLALG QEITIENGAV TQSNFGDFPL LRFADAPDVE VHFVTSDNPP TGLGEPALPP SPAALVNAIF AATGKRIRAL PIGDQLA
|
| |