Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5443 |
Symbol | |
ID | 5897138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | + |
Start bp | 156092 |
End bp | 157720 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641550730 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001672216 |
Protein GI | 167621708 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.078996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGACT ATATTATTGT TGGAGCGGGG TCTGCCGGAT GCTTGTTGGC GGAGCGCTTG TCAGCCAATC CCAGGACGCG GGTCTGTCTG CTTGAGGCGG GCCCGCCCGA CCGCAGCCCG CTGATCCACA TGCCCATTGG GATAGCGCTT CTGTCAAAGA GCAAAATTCT CAATTGGGCA TTCGAGACGC AGCCACAGGC CAATCTCGAT GGTCGACGGC TGTTTTGGCC GCGCGGCAAA ACCCTTGGCG GATCGAGTTC GATCAATGCG ATGGTCTATA TCCGCGGGCA CCGGGATGAC TATGACTCCT GGGGCGAGGC AGCCGATCCG ATCTGGTCCT ATGACAATGT GCTCCCGCTG TTCAAGGCGA TGGAGTCCAA CGAGAGATTT GGAACCGACG CGTTTCATGG CGGCGATGGT GAGCTTCACG TCAGCGACCT GCGAACCCGC AACCCCTTGA GCGATGCCTT CGTCGAGGCC GGACAACAGG CCCAGTTTCC GCATGCCGTC GATTTCAATG GGAAGATGCA GGACGGCGTC GGCCTGTACC AGGTCACCCA GCACAAAGGC CGGCGCTGGA GTTCCGCGCG CGCCTTTCTT TCCAAGGCCA AGGGCCGGCC CAATCTACGG ATAGTCACGG GCGCGCGGGC TACCCGGATC ATTCTGGAGG GCCGCAAAGC GGTCGGCGTG ACCTATGCCG CAGGCGGCAA GCTGGTCGAT GTGCGAACCA GGGGCGGCGA GGTCATTCTT TCGGGCGGCG CCGTCAATTC CCCGCAACTG CTGCTGCTTT CCGGCATCGG CGGCGCGGCC GAGCTGAACG CACTCGGCAT TCCGGTGGTC GTCGACCTTC CGGCAGTTGG AAAAAATCTG CAGGATCACC TCGATATCAC AATCATGCAT GAGGCGAACG ATCGTACACC GATCGGCATC GCACCGTCAT TCATCCCGCG GGCGCTGTCC GGAGCGCTAT CCTACGCCTT CCTTCGAAAG GGTTTCTTGA CGAGCAACGT CGCCGAGGCG GGCGGCTTCG TCAAAAGCAC ACCTTCGCGG AGTCGGCCGA ATCTACAGTT TCATTTCCTC CCCACGCTTT TGAAGGACCA TGGGCGCGAA ATGGCGTTCG GGTATGGCTA TACATTGCAT GTCTGCGATC TTCTGCCCAA GAGCCGAGGC CGCATCGGGC TCACAAGCCC CGACCCGCTC GACGATCCGC TGATCGATCC AAACTATCTC TCGGCCCCCG AAGACATTGA GACCATGGTC GCGGCGGTGA AGATCGGCCG GCAAATTCTG TCGGCGCCGT CAATGGCGGC CTTCTCGAAA ACCGAACTGG TCCCTGGGCC ATCGGTCCAG AGCAAGGCGG ATATCATGGC GGATATCCGT CGGCGAGCGG AGACGATCTA TCATCCGGTG GGAACATGCC GGATGGGACG AGACCCTCAG TCGGTTGTCG ATCCGTCACT CCGAGTGCGT GGCGTGCAAG GCCTTCGCGT CGTCGACGCC TCGGTCATGC CGACGCTGGT CGCCGGAAAC ACCAACGCCC CGACGATGAT GATTGCGGAA AGAGCTGCCG AGCTCATTCT TGGGAAGACG AAACTCGCAC TCAGCGCCAA CATTGAGGCA TTCCGCTAA
|
Protein sequence | MFDYIIVGAG SAGCLLAERL SANPRTRVCL LEAGPPDRSP LIHMPIGIAL LSKSKILNWA FETQPQANLD GRRLFWPRGK TLGGSSSINA MVYIRGHRDD YDSWGEAADP IWSYDNVLPL FKAMESNERF GTDAFHGGDG ELHVSDLRTR NPLSDAFVEA GQQAQFPHAV DFNGKMQDGV GLYQVTQHKG RRWSSARAFL SKAKGRPNLR IVTGARATRI ILEGRKAVGV TYAAGGKLVD VRTRGGEVIL SGGAVNSPQL LLLSGIGGAA ELNALGIPVV VDLPAVGKNL QDHLDITIMH EANDRTPIGI APSFIPRALS GALSYAFLRK GFLTSNVAEA GGFVKSTPSR SRPNLQFHFL PTLLKDHGRE MAFGYGYTLH VCDLLPKSRG RIGLTSPDPL DDPLIDPNYL SAPEDIETMV AAVKIGRQIL SAPSMAAFSK TELVPGPSVQ SKADIMADIR RRAETIYHPV GTCRMGRDPQ SVVDPSLRVR GVQGLRVVDA SVMPTLVAGN TNAPTMMIAE RAAELILGKT KLALSANIEA FR
|
| |