Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2945 |
Symbol | |
ID | 5540435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3819276 |
End bp | 3820496 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895065 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_001433024 |
Protein GI | 156742895 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.879984 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0065134 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCAAC ATACTGCGCA ACCGTTGCCG GTAACGGTTC TGTCGGGGTT TCTGGGCAGC GGTAAAACGA CCCTGCTCAA CCATGTGCTT GCCAATCGCG AAGGTCTGCG CGTGGCAGTC ATCGTCAACG ACATGAGCGA GGTCAACATC GACGCCCGGC TGGTGCGCAG CGGCGGGGCG GCCCTCAGTC GCACCGAAGA GCGCCTGATC GAGATGACCA ATGGGTGCAT CTGCTGCACG CTGCGCGAGG ATTTGTTGGT TGAGGCGGCG CGCCTGGCGC GTGAAGGGCG CTTCGATTAT CTGCTCATCG AGTCGACCGG CATCTCCGAG CCGCTGCCGG TGGCGGAGAC GTTCACGTTC GCGGATGAAA CCGGCGTCAG CCTGGCGGAA CTGGCGCGGC TGGATACGAT GGTGACGGTC GTTGATGCGT TCAATTTTCC GCAGGATTTG TGCTCGACCG ACGACCTGCG TGATCGGAAC ATGGCTGCCG ACGACGATGA TGAACGGTCG GTTGTTGATT TGTTGATCGA TCAGGTTGAG TTCGCCGATG TTCTGGTGCT GAACAAGATC GATCTGGTCG ATCCCGATGT GGTGGATCAA CTGGAAGCGC TTCTGCGCAA ACTGAACCCC GATGCCCGCA TTGTGCGCGC GTCGTTTGGG CGTGTGCCGC TGCGCGAGAT ATTGAATACC GGTCGCTTCA ATTTTGAGCG CGCGGCGCAG GCGCTTGGCT GGCTTAAGGA ACTTCGCGGC GAACATACGC CGGAAACCGA GGAGTATGGC ATTTCGAGTT TTGTCTATCG CGCTCGACGA CCGTTTCATC CTCAACGTTT CTGGGACCTC ATTCACGATG AGTGGCCCGG TGTGTTGCGT TCTAAGGGGC TGATCTGGCT GGCGACGCGC ATGAGTATCA GCGGTCTCTG GTCGCAGGCC GGGAGTGCGT GTCGGGTCGA GCCAGGCGGC TTGTGGTGGG CGGCGCTGCC GGATGATGAA TTGCCAGATG ATCCTGAAGA TGAAGCGCAT CTGGCGCAGG TATGGCACAG TCGGTGGGGC GATCGGCGGC AGGAACTGGT GCTGATCGGG CAGGATATGG ACGAGGCGGC GCTGCGCGCT CGCCTTGATG CCTGCCTGTT GACCGACGAC GAGATGGCGT TGGGTCCCGA AGGGTGGGCG CAGTTTGACG ATCCTTTCGG GACATGGTCG GTGTGGGTGT CCGAGGATTG A
|
Protein sequence | MAQHTAQPLP VTVLSGFLGS GKTTLLNHVL ANREGLRVAV IVNDMSEVNI DARLVRSGGA ALSRTEERLI EMTNGCICCT LREDLLVEAA RLAREGRFDY LLIESTGISE PLPVAETFTF ADETGVSLAE LARLDTMVTV VDAFNFPQDL CSTDDLRDRN MAADDDDERS VVDLLIDQVE FADVLVLNKI DLVDPDVVDQ LEALLRKLNP DARIVRASFG RVPLREILNT GRFNFERAAQ ALGWLKELRG EHTPETEEYG ISSFVYRARR PFHPQRFWDL IHDEWPGVLR SKGLIWLATR MSISGLWSQA GSACRVEPGG LWWAALPDDE LPDDPEDEAH LAQVWHSRWG DRRQELVLIG QDMDEAALRA RLDACLLTDD EMALGPEGWA QFDDPFGTWS VWVSED
|
| |