Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1638 |
Symbol | |
ID | 5899093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1718899 |
End bp | 1719915 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641562127 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001683265 |
Protein GI | 167645602 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.145818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0211449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGAAA GAAACTGGAA CGAGCTGATC CGTCCTGAGA AGCCGCAGAT CGAGACTGGC GCCGACGCGA CCCGCAAGGC TCGCATCGTG GCCGAGCCGC TCGAACGCGG TTTCGGTGTG ACGCTCGGCA ACGCTCTCCG CCGTGTCCTG CTCTCGTCGC TTCAGGGCGC CGCCGTCACC GCCATCCAGA TCGATGGCGT GGTGCACGAA TTCTCCTCGC TCGAGGGCGT CCGCGAAGAC GTCGTCGACA TCGTTCTGAA CATCAAGCAA CTGGCCGTGC GCATGCACGC CGAAGGCCCC AAGCGCATGA CCCTGCGCGC CACGGGTCCC GGCGTCGTGA CCGCGGGTCA GATCGAAACG CCTTCGGACA TCGAGATCCT GAACCCCGAC CACGTGCTCT GCACGCTGGA CGACGGCGCT TCGGTGCGCA TGGAGTTCAC GGTCAACACC GGCAAGGGCT ACGTCCCTGC CGACAAGAAC CGTCCGGAAG ACGCGCCGAT CGGCCTGATC GCCGTCGACG CCCTCTACAG CCCGGTCAAG CGCGTGGCTT ACCGCGTCGA GCCGACCCGT CAGGGTCAAT CGCTCGACTA TGACAAGCTG ATCCTGGAAG TCGAAACCAA CGGCGCCGTC ACGCCGGTGG ACGCGGTGGC CTACGCCGCC CGCATCCTGC AAGACCAGCT GCAGATCTTC ATCACCTTCG AAGAGCCCAA GGCCAAGACG GCCGACGAGG CCAAGCCGGA ACTGCCGTTC AACCCGGCGC TCCTGAAGAA GGTCGATGAG CTGGAACTGT CGGTCCGTTC GGCCAACTGC CTGAAGAACG ACAACATCGT CTATATCGGC GACCTGATCC AGAAGACCGA AGCCGAGATG CTCCGCACCC CGAACTTCGG CCGCAAGTCC TTGAACGAAA TCAAGGAAGT GCTGGCCGGC ATGGGTCTGC ACCTGGGCAT GGACGTTCCG AACTGGCCGC CGGAAAACAT CGAAGACCTG GCCAAGAAGT TCGAAGACCA GATCTAA
|
Protein sequence | MIERNWNELI RPEKPQIETG ADATRKARIV AEPLERGFGV TLGNALRRVL LSSLQGAAVT AIQIDGVVHE FSSLEGVRED VVDIVLNIKQ LAVRMHAEGP KRMTLRATGP GVVTAGQIET PSDIEILNPD HVLCTLDDGA SVRMEFTVNT GKGYVPADKN RPEDAPIGLI AVDALYSPVK RVAYRVEPTR QGQSLDYDKL ILEVETNGAV TPVDAVAYAA RILQDQLQIF ITFEEPKAKT ADEAKPELPF NPALLKKVDE LELSVRSANC LKNDNIVYIG DLIQKTEAEM LRTPNFGRKS LNEIKEVLAG MGLHLGMDVP NWPPENIEDL AKKFEDQI
|
| |