Gene Caul_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0416 
Symbol 
ID5897690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp455945 
End bp456994 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID641560902 
ProductLacI family transcription regulator 
Protein accessionYP_001682051 
Protein GI167644388 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.339437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGT ACGATTCCAA CCTGCCGACG CCCCATGGGC CAGCCACGAT CAAGAACGTG 
GCGCGTGCGG CCGGGGTCTC GGTGGCGACC GTCTCGCGCG CCTTGCAGAT GCCCGCGCGG
GTGGCGCCTG ACACCCGGGC CAAGGTTTCG GCGGCGGTCG AGCGGCTAGG CTACACGCCC
AATGTCCAGG CGCGCAACCT GCGAACCTCC AAGACCTCGA TGATCGTCGC CCTGGTGCCG
GACATCTCCA ACTGCTTTTT CGCCGGGGTG ATCCGGGGCA TCGAGGACGT GGCCACCCGC
AACGGCTATT CGGTCTTGCT GGGCGACATC CAGGACGACG TTTCGCGGGA GCAACGCTAC
AGCGACATGA TCTCCGCCCG CGTGGTCGAT GGCATGATCA CCCTGCTGCC GCGCGTGCCG
AAAATCCAGC GCGCCGGCCG GGCCCCTATC GTCAACGCCT GCGAATATGT CGACGACCCG
GCCATCACCA GCGTCTACAT CAACAACGAG GCGGCCGCCG GCGATGCGAC GCGCTATCTG
CTGACCCTGG GCCATCGCCA GATCGCCTTT ATCGGCGGGC CAGCTTCTAG TCCGATCAGC
ATCGATCGCA AGCGCGGCTA TGAACAGGCC TTGCTGCAGG CGGGGGTCAC CCCGTCGCGG
AAGCTCTGCG CCCAGGGCGA CTTCTCCATG GCCGCCGGCG TGCGGGGCGT GGAGTCGATC
TTCGCCGCGG GCGAGCCCTT CACCGCGGTG CTTTGCGCCA GCGACGAGAT CGCGATCGGC
GTGCTCCAAG CGGCCAAGGC GCGGGGATTT CGCGTGCCGC AGGACTTGTC GATCATCGGG
TTCGACAACA TCATCTTCTC GCAATACATG GATCCGCCCT TGACCACGGT GGCCCAACCG
CAGGAGGACC TGGGGCGCGA AGCGATGATG CTGCTGCTCA ACATCCTCGA TGAACAGGAC
ATCCCGCCGT GCAAGCGGAT CTTGTCCACG CAGCTGGTGG TGCGCGGCTC TACCGGTCCA
GCGCCGCGTC AGGCCCTCAT CGCCGACTAG
 
Protein sequence
MLEYDSNLPT PHGPATIKNV ARAAGVSVAT VSRALQMPAR VAPDTRAKVS AAVERLGYTP 
NVQARNLRTS KTSMIVALVP DISNCFFAGV IRGIEDVATR NGYSVLLGDI QDDVSREQRY
SDMISARVVD GMITLLPRVP KIQRAGRAPI VNACEYVDDP AITSVYINNE AAAGDATRYL
LTLGHRQIAF IGGPASSPIS IDRKRGYEQA LLQAGVTPSR KLCAQGDFSM AAGVRGVESI
FAAGEPFTAV LCASDEIAIG VLQAAKARGF RVPQDLSIIG FDNIIFSQYM DPPLTTVAQP
QEDLGREAMM LLLNILDEQD IPPCKRILST QLVVRGSTGP APRQALIAD