Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2585 |
Symbol | cobD |
ID | 4077496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2722447 |
End bp | 2723358 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638007909 |
Product | cobalamin biosynthesis protein |
Protein accession | YP_614579 |
Protein GI | 99082425 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1270] Cobalamin biosynthesis protein CobD/CbiB |
TIGRFAM ID | [TIGR00380] cobalamin biosynthesis protein CobD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.650377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000040763 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACCG CCGCCCTGTT GATCCCCGCC ATGATCCTCG ACGCCGCATT TGGCGAGCCC AAATGGCTCT GGTCGCGGCT GCCGCATCCC GCTGTGCTGA TGGGCAAGCT GGTGCAAGCG CTGGATGATC GGCTGAACGA AGGTGACGGG CGCCAGGTCA AAGGTGTTAT CGCGGTCGCG GTGCTGGTAT TCGTCGGTTT GCTCCTCGGC TGGATCCTGT CCTGGTTTGG CAGCCTCGTA AGCGTTCTGA TCGCCGCCAT CCTGATCGCT CAACGGTCTC TTATTGACCA TGTGCGCGCA GTCGCCACCG GGCTTCAGAA TGACCTCGAC GCGGGGCGCT CTGCGGTCGC GATGATCGTC AGCCGCGACA CCGCGACCAT GACCGGCCCG CAGATCGCGC GCTCCGCCAT CGAGAGCGGT GCGGAGAATT TCTCCGACGG GGTTATCGCG CCTGCATTCT GGTTTCTGGT CGCGGGCCTG CCGGGGCTGC TGATCTACAA GCTCATCAAC ACCGCCGACA GCATGATTGG CTATCGGACC GAGGCGTACG AGGACTTCGG CTGGGCAGCG GCGCGGCTCG ACGATGTGCT CAACATCCTG CCGGCGCGCC TCAGCGCACT CCTGATCGCG CTTGTGACCG GCCGCGCAGG TGATTGGGGC GAGATCAGCG CGGATGCACG CAAGCATCGC TCTCCCAATG CAGGCTGGCC CGAGGCAGCG ATGGCGCGCG CACTTGGTGT CGCTCTGGCC GGGCCGCGCT CTTATGATGG CGAAATGCGT CCCTTTGCCT GGGTAAACGC GAGCGGATCA AAAAGCGCCA GCGCTCATAG CATCACCCGC TGTTGCGAGG TGCTTTGGAA ATCCTGGGGT CTCGCGCTGG TTCTGGTGGT TGCATTGGGT CTGCTCTTCT AG
|
Protein sequence | MSTAALLIPA MILDAAFGEP KWLWSRLPHP AVLMGKLVQA LDDRLNEGDG RQVKGVIAVA VLVFVGLLLG WILSWFGSLV SVLIAAILIA QRSLIDHVRA VATGLQNDLD AGRSAVAMIV SRDTATMTGP QIARSAIESG AENFSDGVIA PAFWFLVAGL PGLLIYKLIN TADSMIGYRT EAYEDFGWAA ARLDDVLNIL PARLSALLIA LVTGRAGDWG EISADARKHR SPNAGWPEAA MARALGVALA GPRSYDGEMR PFAWVNASGS KSASAHSITR CCEVLWKSWG LALVLVVALG LLF
|
| |