Gene Caul_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2141 
Symbol 
ID5899596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2316260 
End bp2317282 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content71% 
IMG OID641562631 
ProductLacI family transcription regulator 
Protein accessionYP_001683767 
Protein GI167646104 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0418584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0217452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA TCCACGATGT GGCGCTGCAA GCGGGCGTGT CGCCAAAGAC CGTCTCGCGG 
GTGCTGAACG ATCACGAGAG CGTCACCGCC AAGACCCGCG AGCGCGTACG CGGCGCCATG
CAGGCCCTCG ACTATCATCC CAACGCCGTG GCGCGCGGCC TGCGCTCGCA CGCCGCCCCG
GCCGTCGGCA TCCTGATGGG CGACCCCAGC GGCGGCTACC AGACCCGCAT CCACCACGCC
CTGATGGTCG CCTGCCTGCA GAACGGCCGC CACCTGTCGG CCGAGTTGGT CGAGGGCGAC
ATGGCCGGCT GGCAGGATCG CATCCGCGCC TTCGTCACCG AGGGCGGGAT CCGCGAGATG
ATCCTGCTGC CGCCCGAATG CGACTTCGCC CCGCTCAAGA CGCTGCTGCG CGAGCACGAC
GTGCGCTGTG TGCTGATCTC GCCCACCAGC CCCGATTCGC AATCGCCCAG CATCGTGATG
GACGACCGCG CCGCCGCGCG CGAGGTGGTC GAGCACCTGT TCAGCCTGGG CCATGAACGG
ATCGGCCATA TCGCCGGCCA CCCGGACCAC GCCGCCAGCA CCCTGCGCCG CAATGGCTTC
AACGAGGCCT ACGCCGCCGC CGGCAAGCCG CGCCCCGATC CGGCGCTGAT CGTACCCGGC
GACTTCACGT TCAAGGGCGG CCTGGCCGGC GCCCAGGCCC TGCTGGACAT GGAAAACCCG
CCGACCGCCA TCTTCGCGGC CAATGACGAC ATGGCGGCCG CCACCTGCAT GGAGGCCCAG
CGCCGCGGCC TGCGCATTCC CGACGACCTG TCCGTGGTCG GGTTCGACGA CGCGCCGATC
GCCGCCGCGA TCTGGCCGTC CCTGACCACG ATCCGCCAGC CCTTCGACCA GATGACCCAG
CGGGCCATCA CCGCCCTCGG CGCCTGGAAC GCCAACGCGG CGCTCGGCAA GTCGGCGGCG
ACGATCCTGA CAAAGCACAG TCTGGTCGTC CGCGAATCCA CCGGCCCCGT CAGGGCCGGG
TAA
 
Protein sequence
MATIHDVALQ AGVSPKTVSR VLNDHESVTA KTRERVRGAM QALDYHPNAV ARGLRSHAAP 
AVGILMGDPS GGYQTRIHHA LMVACLQNGR HLSAELVEGD MAGWQDRIRA FVTEGGIREM
ILLPPECDFA PLKTLLREHD VRCVLISPTS PDSQSPSIVM DDRAAAREVV EHLFSLGHER
IGHIAGHPDH AASTLRRNGF NEAYAAAGKP RPDPALIVPG DFTFKGGLAG AQALLDMENP
PTAIFAANDD MAAATCMEAQ RRGLRIPDDL SVVGFDDAPI AAAIWPSLTT IRQPFDQMTQ
RAITALGAWN ANAALGKSAA TILTKHSLVV RESTGPVRAG