Gene Caul_4265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4265 
Symbol 
ID5901726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4636092 
End bp4638041 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content65% 
IMG OID641564784 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001685884 
Protein GI167648221 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.914604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.695759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA CGACGACTGA CGGAGCCGAA CCGCCCGAAA CCGGTGGCGG GGATGGCCCC 
CTGCTCGACC TGACCGACGC CGGCGTCAAG AAGTTCATCA AGCAGGCCAA GGCCCGCGGC
TACGTCACCA TGGACGAGCT GAACAAGGTC CTGCCGTCGG AAGAAGTGAC CTCCGAGGCC
ATCGAAGACA CCCTCGCCAT GCTGAGCGAG ATGGGCGTCA ACGTGGTCGA GGCCGAAGAA
GACGCTGAAA CCTCCGAGGG CGGCGAAGTC GTCGCCCGCG AGGACAACCA GCAGGTCATC
ACCAGCGACA AGCCGGCGGC CTATGACCGC ACGGACGATC CCGTGCGCAT GTACCTGCGC
GAGATGGGCA GCGTCGAGCT GCTGTCGCGC GAGGGCGAAA TCGCCATCGC CAAGCGCATC
GAGGCCGGCC GCGACACCAT GATCCGGGGT CTGTGCGAAA GCGCCCTGAC CTTCGAAGCC
ATCATGGTCT GGCGCGAGGA ACTGGGCACC GGCCGCATCC TGCTGCGCGA AGTGATCGAC
CTGGAAGGCA CCTACGCCGC CATCAACGGC GTGGCCGCCG CGCCCGCCGC CGACGACGAG
CCGGCCGAGC CGTCGGACGA GGAACCCGGC AGCGAACCCA AGGCCGAGGG CGAGGAAGAC
GAGGACGATT TCGACGATGG CGCCGGCCCG ACGGTCAGCG CCATGGAAGG CGAACTGCGC
GAAGGCGTGA TGGCCATCCT CGACGCGATC GCCAGCGAGT TCGAAACCTT CCGCAAGCTG
CAGGACAAGT TGGTCGGCAG CCGCCTGAAG GGCGCGGACC TCTCGGACGC CGACCGCAAG
GCCTATGAGG GCCTGTCGAA CACCATCATC CAGCACCTCA AGACGCTGAA GCTGAACAAC
AACCGCATCG AGGCGCTGGT CGAGCAGCTC TATGCGATCA ACAAGCGCCT GATCGGCCTG
GAAGGCCGGC TGCTGCGCCT GGCCGACAGC TACGGCATCA GCCGCAGCGA GTTCCTCAAG
GCCTATTTCG GTTCGGAGCT GAACCCGAAC TGGATCGACC AGGTCAAGGC CATGGGCGTG
CGCTGGACCA AGTTCGTCGA GAACGACACC AACTCGGTCG GCGACATCCG TCAGGAAATC
GCCGCCCTGG CCACCGAGAC CGGCGTGCCG ATCGACGATT ATCGCCGCAT CGTCCAGACC
GTGCAGAAGG GCGAACGCGA GGCCCGGCAG GCCAAGAAGG AGATGGTGGA AGCCAATCTC
CGTCTCGTGA TCTCGATCGC CAAGAAGTAC ACCAACCGCG GCCTGCAGTT CCTGGACCTG
ATCCAGGAAG GCAATATCGG CCTGATGAAG GCTGTCGATA AGTTCGAATA TCGCCGCGGC
TACAAGTTCT CGACCTACGC CACCTGGTGG ATCCGTCAGG CGATCACCCG CTCGATCGCC
GACCAGGCGC GGACCATCCG CATCCCGGTG CACATGATCG AGACGATCAA CAAGATCGTC
CGCACCAGCC GCCAGATGCT GCACGAGATC GGCCGCGAGC CGACCCCGGA AGAGCTGGCC
GAAAAACTGG CCATGCCGCT GGAGAAGGTG CGCAAGGTCC TGAAGATCGC CAAGGAGCCG
ATTTCGCTCG AAACCCCGAT CGGGGACGAG GAAGACAGCC ACCTGGGCGA CTTCATCGAG
GACAAGAACG CCATCCTGCC GATCGACGCG GCCATCCAGT CCAACCTGCG GGAAACCACC
ACCCGCGTGC TGGCCTCGCT GACCCCGCGT GAAGAGCGCG TCCTGCGCAT GCGTTTTGGC
ATCGGCATGA ACACCGACCA CACCCTGGAA GAGGTGGGCC AGCAGTTCTC GGTCACCCGC
GAACGCATCC GCCAGATCGA GGCCAAGGCC CTGCGCAAGC TCAAGCACCC GAGCCGGTCG
CGCAAGCTGC GGTCGTTCCT GGACTCCTAA
 
Protein sequence
MSTTTTDGAE PPETGGGDGP LLDLTDAGVK KFIKQAKARG YVTMDELNKV LPSEEVTSEA 
IEDTLAMLSE MGVNVVEAEE DAETSEGGEV VAREDNQQVI TSDKPAAYDR TDDPVRMYLR
EMGSVELLSR EGEIAIAKRI EAGRDTMIRG LCESALTFEA IMVWREELGT GRILLREVID
LEGTYAAING VAAAPAADDE PAEPSDEEPG SEPKAEGEED EDDFDDGAGP TVSAMEGELR
EGVMAILDAI ASEFETFRKL QDKLVGSRLK GADLSDADRK AYEGLSNTII QHLKTLKLNN
NRIEALVEQL YAINKRLIGL EGRLLRLADS YGISRSEFLK AYFGSELNPN WIDQVKAMGV
RWTKFVENDT NSVGDIRQEI AALATETGVP IDDYRRIVQT VQKGEREARQ AKKEMVEANL
RLVISIAKKY TNRGLQFLDL IQEGNIGLMK AVDKFEYRRG YKFSTYATWW IRQAITRSIA
DQARTIRIPV HMIETINKIV RTSRQMLHEI GREPTPEELA EKLAMPLEKV RKVLKIAKEP
ISLETPIGDE EDSHLGDFIE DKNAILPIDA AIQSNLRETT TRVLASLTPR EERVLRMRFG
IGMNTDHTLE EVGQQFSVTR ERIRQIEAKA LRKLKHPSRS RKLRSFLDS