Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3644 |
Symbol | |
ID | 5901099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3934757 |
End bp | 3935737 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564155 |
Product | hypothetical protein |
Protein accession | YP_001685269 |
Protein GI | 167647606 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.16096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.19013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACACG ACAAGGCGAC CCGATTGCTC GACTTGGCCC GGATGCTGGC CGGATCGTCC GAAGGCATGA CCCTCGACGA GATGGCCCGC GCCATGGAGG TTGGCCGGCG CACGGCCGAG CGGATGCGCG ACGCCGTCTG GGCGGCCTTC CCGCAGATGG AGTCGATCGA CGATCCGCCG ACCAAGCGCT TCCGCATCCC CTCGGGCCTG GACAGCCTGT TCCAGACCCC GACCGCCGAG GAACTGGCCG CCCTGCGCAC CGCCGCCGAC AGCTATGCCG CCAGCGGGGC CGAGGGGCGG TCGGCCGCCC TGTACGCGCT GGAGCGCAAG CTGCTGTCGG CCCTGCGCGG CGCCGCCCGC CGCAAGGTGG CCCCGGACGT CGAGGCCCTG GTCCAGGCCG AGACCATCGC CGTCCACCCC GGTCCCCGGC CGTTCGAGAG CGAGCAGGTC CTTTCGGCCA TTCGCACCGC CGTGAAGAGC CTGCAGGCCT TGTCGTTCCG GTACGAGGGC GGCTCCGCGC CCGGCCGGAC CCGCAAGGTG ACGCCGCTCG GCATCCTGTT CGGCCACAGC AACTATCTGG TGGCGACGGA GGGACGTGAT CCCAAGCCCC GAACCTTCCG TCTCGATCGC ATGCAGGCCG TCACGGCGCT CGACGAGCCC GCCCCGCCGC CGGAAGATTT CTCGTTGCAG GCCTTCGCCG ACGAGAGCTT CGGCATCTAT CACGGCGAGG TCGAGGACGT GGTTCTGCGC GTCAAGCCCG GCCGCGCCGA CGACGCCCTG CGCTGGCGCT TCCACCCCAG CCAGACCGTG ACCCAGGAGG CGGACGGGTC GGTGATCGTC GCCTTCCGCG CCAGCGGCAT GCTGGAACTG TCCTGGCACC TGTTCACCTG GGGCGACGCG GTTGAGATCC TCTCGCCGCC GGGACTGCGC GCAATGATGG TTGAGGAGCT GAAGACCGCC CTGCGAGCCC ATGAGGGCTA G
|
Protein sequence | MRHDKATRLL DLARMLAGSS EGMTLDEMAR AMEVGRRTAE RMRDAVWAAF PQMESIDDPP TKRFRIPSGL DSLFQTPTAE ELAALRTAAD SYAASGAEGR SAALYALERK LLSALRGAAR RKVAPDVEAL VQAETIAVHP GPRPFESEQV LSAIRTAVKS LQALSFRYEG GSAPGRTRKV TPLGILFGHS NYLVATEGRD PKPRTFRLDR MQAVTALDEP APPPEDFSLQ AFADESFGIY HGEVEDVVLR VKPGRADDAL RWRFHPSQTV TQEADGSVIV AFRASGMLEL SWHLFTWGDA VEILSPPGLR AMMVEELKTA LRAHEG
|
| |