Gene Caul_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0829 
Symbol 
ID5898284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp888353 
End bp889609 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID641561311 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001682458 
Protein GI167644795 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGA TCGCCAACAT GGACGCGGCC TGGATCGACG CGGCCCTGAC CTCGGCCCGG 
CCGCGGGCGG TGGCGGCGTT GCTGCGCTAT TTCTGTGATC TCGAAATCGC CGAGGAGGCC
TTCCAGGACG CCTGCCTGCG GGCTCTCAAG GCCTGGCCCA GGAACGGCCC GCCGCGCGAC
CCGACCGCCT GGCTGATCCT GGTGGGCCGC AACGCGGCGC TGGACGGGGT GCGCAAGCAG
AGCAAGTCCT CGCCCCTGCC GCCGGAGGAG CTGATCTCCG ACCTGGAGGA CGCCGAGGCC
GCCCTGGTCG ACCGGCTGGA CGGCGAACAC TATCGCGACG ACGTGCTGCG GCTGCTGTTC
ATCTGCTGCC ATCCGGACCT GCCGGCCACC CAGCAGGTGG CGGTGGCCCT GCGCATCGTC
TCGGGCCTGT CGGTGCGGCA GATCGCCCGC GCCTTCCTGG TCGGCGAGTC GGCGATGGAG
CAGCGGATCA CTCGCGCCAA GGCCCGGATC GGGGACGGTG ACGTGCCGTT CGGCGCGCCG
GACATCGAGG AGCGCGCCCG GCGGCTGACC ACCGTGGCGG CCATGGTCTA TCTGGTCTTC
AACGAGGGCT ATTCGGCCGG CGGCCAGGAG ATCCAGGCGC GGGGATCGCT GTGCGACGAG
GCGATCCGCC TGGCGCGGCT GCTGCTGCGG CTGTTTCCGA GCGAGCCGGA GGTCATGGGC
CTGATGGCGC TGCTGCTGCT CCAGCACGCC CGCGCCCCGG CCCGGTTCGA CGCCGAGGGG
GCGGTGGTGC TGCTGGAGGA CCAGGACCGG AGCCTGTGGA ACCCGCGGAT GATCGCCGAG
GGCCTGGCCC TGATTGACAA GGCCATGCGC CATCGCCGGC CGGGACCCTA TCTGGTGCAG
GCGGCCATCG CCGCCGAGCA CGCCCGGGCG GCGCGGGCGC GGGACACCCG CTGGGAGCGG
ATCGACCGGC TGTACGGCGA CCTGGAGCAG CTGGCGCCGT CGCCGGTGGT CAGCCTGAAC
CGCGCGGTGG CGGTGTCGAA AGTCGCCGGC CCCGAGGCGG CCCTGGCGAT GATCGAGCCG
CTGGCTCCCA AGCTGTCGGG CTATTTCTAC TTCTTCGGCC TCAAGGGCGG CCTGCTGTTC
CAGCTGGGCC GGCGGGAGGA GGCGCGCGCC GCCTTCGACC AGGCCATCGC CCTGGCCAGC
ACCGCCGCCG AGGCCGCCCA CATCCGGCTG CATCTGGACC GGCTGATGAA GGGGTAG
 
Protein sequence
MNEIANMDAA WIDAALTSAR PRAVAALLRY FCDLEIAEEA FQDACLRALK AWPRNGPPRD 
PTAWLILVGR NAALDGVRKQ SKSSPLPPEE LISDLEDAEA ALVDRLDGEH YRDDVLRLLF
ICCHPDLPAT QQVAVALRIV SGLSVRQIAR AFLVGESAME QRITRAKARI GDGDVPFGAP
DIEERARRLT TVAAMVYLVF NEGYSAGGQE IQARGSLCDE AIRLARLLLR LFPSEPEVMG
LMALLLLQHA RAPARFDAEG AVVLLEDQDR SLWNPRMIAE GLALIDKAMR HRRPGPYLVQ
AAIAAEHARA ARARDTRWER IDRLYGDLEQ LAPSPVVSLN RAVAVSKVAG PEAALAMIEP
LAPKLSGYFY FFGLKGGLLF QLGRREEARA AFDQAIALAS TAAEAAHIRL HLDRLMKG