Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0829 |
Symbol | |
ID | 5898284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 888353 |
End bp | 889609 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641561311 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_001682458 |
Protein GI | 167644795 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA TCGCCAACAT GGACGCGGCC TGGATCGACG CGGCCCTGAC CTCGGCCCGG CCGCGGGCGG TGGCGGCGTT GCTGCGCTAT TTCTGTGATC TCGAAATCGC CGAGGAGGCC TTCCAGGACG CCTGCCTGCG GGCTCTCAAG GCCTGGCCCA GGAACGGCCC GCCGCGCGAC CCGACCGCCT GGCTGATCCT GGTGGGCCGC AACGCGGCGC TGGACGGGGT GCGCAAGCAG AGCAAGTCCT CGCCCCTGCC GCCGGAGGAG CTGATCTCCG ACCTGGAGGA CGCCGAGGCC GCCCTGGTCG ACCGGCTGGA CGGCGAACAC TATCGCGACG ACGTGCTGCG GCTGCTGTTC ATCTGCTGCC ATCCGGACCT GCCGGCCACC CAGCAGGTGG CGGTGGCCCT GCGCATCGTC TCGGGCCTGT CGGTGCGGCA GATCGCCCGC GCCTTCCTGG TCGGCGAGTC GGCGATGGAG CAGCGGATCA CTCGCGCCAA GGCCCGGATC GGGGACGGTG ACGTGCCGTT CGGCGCGCCG GACATCGAGG AGCGCGCCCG GCGGCTGACC ACCGTGGCGG CCATGGTCTA TCTGGTCTTC AACGAGGGCT ATTCGGCCGG CGGCCAGGAG ATCCAGGCGC GGGGATCGCT GTGCGACGAG GCGATCCGCC TGGCGCGGCT GCTGCTGCGG CTGTTTCCGA GCGAGCCGGA GGTCATGGGC CTGATGGCGC TGCTGCTGCT CCAGCACGCC CGCGCCCCGG CCCGGTTCGA CGCCGAGGGG GCGGTGGTGC TGCTGGAGGA CCAGGACCGG AGCCTGTGGA ACCCGCGGAT GATCGCCGAG GGCCTGGCCC TGATTGACAA GGCCATGCGC CATCGCCGGC CGGGACCCTA TCTGGTGCAG GCGGCCATCG CCGCCGAGCA CGCCCGGGCG GCGCGGGCGC GGGACACCCG CTGGGAGCGG ATCGACCGGC TGTACGGCGA CCTGGAGCAG CTGGCGCCGT CGCCGGTGGT CAGCCTGAAC CGCGCGGTGG CGGTGTCGAA AGTCGCCGGC CCCGAGGCGG CCCTGGCGAT GATCGAGCCG CTGGCTCCCA AGCTGTCGGG CTATTTCTAC TTCTTCGGCC TCAAGGGCGG CCTGCTGTTC CAGCTGGGCC GGCGGGAGGA GGCGCGCGCC GCCTTCGACC AGGCCATCGC CCTGGCCAGC ACCGCCGCCG AGGCCGCCCA CATCCGGCTG CATCTGGACC GGCTGATGAA GGGGTAG
|
Protein sequence | MNEIANMDAA WIDAALTSAR PRAVAALLRY FCDLEIAEEA FQDACLRALK AWPRNGPPRD PTAWLILVGR NAALDGVRKQ SKSSPLPPEE LISDLEDAEA ALVDRLDGEH YRDDVLRLLF ICCHPDLPAT QQVAVALRIV SGLSVRQIAR AFLVGESAME QRITRAKARI GDGDVPFGAP DIEERARRLT TVAAMVYLVF NEGYSAGGQE IQARGSLCDE AIRLARLLLR LFPSEPEVMG LMALLLLQHA RAPARFDAEG AVVLLEDQDR SLWNPRMIAE GLALIDKAMR HRRPGPYLVQ AAIAAEHARA ARARDTRWER IDRLYGDLEQ LAPSPVVSLN RAVAVSKVAG PEAALAMIEP LAPKLSGYFY FFGLKGGLLF QLGRREEARA AFDQAIALAS TAAEAAHIRL HLDRLMKG
|
| |