Gene Caul_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1638 
Symbol 
ID5899093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1718899 
End bp1719915 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content63% 
IMG OID641562127 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001683265 
Protein GI167645602 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.145818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0211449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAAA GAAACTGGAA CGAGCTGATC CGTCCTGAGA AGCCGCAGAT CGAGACTGGC 
GCCGACGCGA CCCGCAAGGC TCGCATCGTG GCCGAGCCGC TCGAACGCGG TTTCGGTGTG
ACGCTCGGCA ACGCTCTCCG CCGTGTCCTG CTCTCGTCGC TTCAGGGCGC CGCCGTCACC
GCCATCCAGA TCGATGGCGT GGTGCACGAA TTCTCCTCGC TCGAGGGCGT CCGCGAAGAC
GTCGTCGACA TCGTTCTGAA CATCAAGCAA CTGGCCGTGC GCATGCACGC CGAAGGCCCC
AAGCGCATGA CCCTGCGCGC CACGGGTCCC GGCGTCGTGA CCGCGGGTCA GATCGAAACG
CCTTCGGACA TCGAGATCCT GAACCCCGAC CACGTGCTCT GCACGCTGGA CGACGGCGCT
TCGGTGCGCA TGGAGTTCAC GGTCAACACC GGCAAGGGCT ACGTCCCTGC CGACAAGAAC
CGTCCGGAAG ACGCGCCGAT CGGCCTGATC GCCGTCGACG CCCTCTACAG CCCGGTCAAG
CGCGTGGCTT ACCGCGTCGA GCCGACCCGT CAGGGTCAAT CGCTCGACTA TGACAAGCTG
ATCCTGGAAG TCGAAACCAA CGGCGCCGTC ACGCCGGTGG ACGCGGTGGC CTACGCCGCC
CGCATCCTGC AAGACCAGCT GCAGATCTTC ATCACCTTCG AAGAGCCCAA GGCCAAGACG
GCCGACGAGG CCAAGCCGGA ACTGCCGTTC AACCCGGCGC TCCTGAAGAA GGTCGATGAG
CTGGAACTGT CGGTCCGTTC GGCCAACTGC CTGAAGAACG ACAACATCGT CTATATCGGC
GACCTGATCC AGAAGACCGA AGCCGAGATG CTCCGCACCC CGAACTTCGG CCGCAAGTCC
TTGAACGAAA TCAAGGAAGT GCTGGCCGGC ATGGGTCTGC ACCTGGGCAT GGACGTTCCG
AACTGGCCGC CGGAAAACAT CGAAGACCTG GCCAAGAAGT TCGAAGACCA GATCTAA
 
Protein sequence
MIERNWNELI RPEKPQIETG ADATRKARIV AEPLERGFGV TLGNALRRVL LSSLQGAAVT 
AIQIDGVVHE FSSLEGVRED VVDIVLNIKQ LAVRMHAEGP KRMTLRATGP GVVTAGQIET
PSDIEILNPD HVLCTLDDGA SVRMEFTVNT GKGYVPADKN RPEDAPIGLI AVDALYSPVK
RVAYRVEPTR QGQSLDYDKL ILEVETNGAV TPVDAVAYAA RILQDQLQIF ITFEEPKAKT
ADEAKPELPF NPALLKKVDE LELSVRSANC LKNDNIVYIG DLIQKTEAEM LRTPNFGRKS
LNEIKEVLAG MGLHLGMDVP NWPPENIEDL AKKFEDQI