Gene Caul_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2074 
Symbol 
ID5899529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2220431 
End bp2221444 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content70% 
IMG OID641562563 
ProductLacI family transcription regulator 
Protein accessionYP_001683700 
Protein GI167646037 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.243502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.538933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGT CGCGAGTGAT CAATGATGGC GCCACGGTCC GGGAATCGAC GCGCATGGCG 
GTCCTGGCCG CGATCCGCGA ACTGAACTAC GAACCAAACC TCGCGGCCCG CAACCTCGTC
ATGGCCGGAG AACTGCGGAT CGGGGTGATC TATTCCAACC CCAGCGCCGC TTTCATGAGC
GACTTCCTGG TCGGTGTGTT CGAGGAGGCC ACCAGCGCCG GGGCCAGGCT GATCCTGGTG
CGAGGCGAAA GGGGCCAGGT CCCGACGCCC GAGGAACTGC AGAGGCTCCT GGCGTCCGGC
GTCCACGGCG TGGTCCTGGC GCCCCCCCTG GGCGATTCAG CCCTCGTGCG CGACATGTTC
CGCGCCGCCA ATCTGCCCGT CGCCGTGGTC GCCGCGGGAC GGCCGCCGGC CGACGCGATC
AACGTCCGCA TCGACGACCA CCAGGCCAGC CAGGCCATGG TGCAGCATCT GCTGAATCTC
GGCCATCGAA GGATCGGTTT CATCGCCGGC AACCCTGACC AGAGCGCCAG CGCCGAGCGC
CTTGAGGGCG CGCGCGCGGC GATCGCGGCG GTCGAGGGGG CTGAACTCGT CCTGGCCCAG
GGCACCTTCA CCTACGGTTC GGGTTTGCGC GCCGCCGAGT GGTTGCTCGA TTCCGATCCG
CCGCCCACCG CGATCTTCGC CAGCAACGAC GACATGGCCG CCGCGGCCGT GTCGGTGGCC
CACCGTCGGC ACCTCGACGT GCCGCGCGAC CTGACCGTGG TCGGGTTCGA TGACACCACC
GTGGCCACCA CCCTTTGGCC GCCGCTGACC ACCATACGCC AGCCGGTGCG GCAGATGGCG
GCCGTGGCCC TGGACCGGCT GATGCGCGCC TTGCGATCGG CCGAGCCCAT GGCCGAAGCT
TCCGCCGACT ATGTCCTGGG CCACGCTCTC ATCGAGCGCG AGTCCACCGC CCCGCCCCGG
CGCGCCACCC GGATCGCGGG CGCGAAAACT CACCAGAGGT CATCGCATGC CTGA
 
Protein sequence
MTVSRVINDG ATVRESTRMA VLAAIRELNY EPNLAARNLV MAGELRIGVI YSNPSAAFMS 
DFLVGVFEEA TSAGARLILV RGERGQVPTP EELQRLLASG VHGVVLAPPL GDSALVRDMF
RAANLPVAVV AAGRPPADAI NVRIDDHQAS QAMVQHLLNL GHRRIGFIAG NPDQSASAER
LEGARAAIAA VEGAELVLAQ GTFTYGSGLR AAEWLLDSDP PPTAIFASND DMAAAAVSVA
HRRHLDVPRD LTVVGFDDTT VATTLWPPLT TIRQPVRQMA AVALDRLMRA LRSAEPMAEA
SADYVLGHAL IERESTAPPR RATRIAGAKT HQRSSHA