Gene Caul_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4958 
Symbol 
ID5902420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5360746 
End bp5362254 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID641565479 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001686576 
Protein GI167648913 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCTCG GCCACAGATT GGAGCTTCGG CAAGGCCAAG GCCTTGTCAT CACGCCGCAG 
CTTCAGCAGG CGATCAAGCT GCTGCAGCTG TCGAACCTCG AGCTCGACGC CTTCGTCGAG
GCCGAGCTCG AGCGCAACCC CCTGCTCCAG CGCGAAGACG GTCCGGTCGA GGCCGAACGC
GCGCCTGAAG ACGTCGAGCG CGCGCCCGAG AACCTCGGCC TGGACTCGGT GTCCGACAGC
GCCGCCAGCG TCCAGATGGA CGCCACGCCC GACGACGTCT CGCCCGGCGA GCGCGCCACC
GGCGACGGCC CGGACTCTGA AGGTGGAGCC GAGCAGGCCG GCGGCCAGAT CGACTGGTCG
CGAACCGGCG GCGGCGGCAA TTTCGAGAGC GACGACAATT ACGAAAGCGC CCTGTCGCGC
GCCCCAACCC TTTGCGAGCA CCTGACCGAG CAGTTGGCCG TGGCCGGCCT GACCGCCGCC
CAGAACGTGG TGGCCGGGGT GCTGATCGAC GCGGTCGATG AGACCGGCTA CCTGCGCGCC
GACCTGCTCG AAGTGGCCGA GCGCCTGGGC TGCGGCCTGG ATTTCGTGGA AGGCGTGCTG
ACCGTCCTGC AAGGCTTCGA GCCCGCCGGC GTCATGGCCC GCGACGTGCG CGAGTGCCTG
GCCCTGCAGC TGAAGGACGT CAATCGCTAC GACCCGTGCA TGGCCGCCCT GCTCGACAAT
CTCGACCTGC TGGCCAAGCG CGACATGGCG TCCCTGCGCA AGGTCTGCGG CGTGGACGAC
GAGGACCTGC GCGAGATGGT CGCGGAGATC CGCGCCCTGA CCCCTCGCCC CGGCGCGGCC
TTCCACGCGG ACCCGCCCCA GACCCTGGTG CCCGACGTCC ATGTCCGCGA GACGCCCGGC
GGCCTGTGGC ACGTCGAGCT GAACACCGAC ACCCTGCCCC GGGTTCTGGT CGACCAGCGC
TACCACGCCC GGGTCAGCAA GGGCGCGCGC ACCGACCAGG AAAAGACCTT CGTCTCCGAC
AGCTTCGCCA CCGCCAACTG GCTGGTCAAG AGTCTCGACC AGCGGGCCAA GACCATCCTG
AAGGTGTCCA GCGAGATCGT CCGCCAGCAG GACGGCTTCC TGGCCTATGG CGTCGAACAT
CTGCGCCCGC TGAACCTTAA GACCGTCGCC GACGCCATCG GCATGCACGA GAGCACCGTC
AGCCGGGTGA CCTCCAACAA GTACATCGCC ACCCCGCGCG GGGTGTTCGA ACTGAAATAT
TTCTTCACCT CGGCCATCCA GTCCAGCGAG GGCGGCGAGG CCCATTCGGC CGCCAGCGTG
CGCTACAAGA TCAAGGGCCT GGTCGAGGCC GAGCGCACCG AGGCCGACGT CCATTCCGAT
GACCGCATCG TCGAGATCCT CAAGGACGCC GGCGTCGACA TCGCCCGCCG GACGGTGGCC
AAGTATCGAG AAGCCCTGCG TATTCCGTCC TCCGTCGAGC GTCGCCGCTT GATCAAGGAA
ACCGCCTGA
 
Protein sequence
MALGHRLELR QGQGLVITPQ LQQAIKLLQL SNLELDAFVE AELERNPLLQ REDGPVEAER 
APEDVERAPE NLGLDSVSDS AASVQMDATP DDVSPGERAT GDGPDSEGGA EQAGGQIDWS
RTGGGGNFES DDNYESALSR APTLCEHLTE QLAVAGLTAA QNVVAGVLID AVDETGYLRA
DLLEVAERLG CGLDFVEGVL TVLQGFEPAG VMARDVRECL ALQLKDVNRY DPCMAALLDN
LDLLAKRDMA SLRKVCGVDD EDLREMVAEI RALTPRPGAA FHADPPQTLV PDVHVRETPG
GLWHVELNTD TLPRVLVDQR YHARVSKGAR TDQEKTFVSD SFATANWLVK SLDQRAKTIL
KVSSEIVRQQ DGFLAYGVEH LRPLNLKTVA DAIGMHESTV SRVTSNKYIA TPRGVFELKY
FFTSAIQSSE GGEAHSAASV RYKIKGLVEA ERTEADVHSD DRIVEILKDA GVDIARRTVA
KYREALRIPS SVERRRLIKE TA