Gene Caul_2640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2640 
Symbol 
ID5900095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2865893 
End bp2866882 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID641563131 
ProductAraC family transcriptional regulator 
Protein accessionYP_001684265 
Protein GI167646602 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.708344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATT TCCCAACTCG TCCCCGCAAG ATCGCGATCG TCGGTTATCG CGGCGCGCAA 
TCGCTCGATA TCAACGGGCC TTTCGAGGTG TTCGCCATGG CAAACCGGTT CGGCGGCGTG
ACCGTTTACG AACCGATCCT GGCCTCGCCC CACGGCGGCG CGATCGTCTG CAATTCCGGG
CTCGGCATCG CGGGTTCGGT GGCCTTCGCC GACCTTCCCA CCGACCTCGA TACGATCCTG
GTCGCCGGCG GGGACGAAGA GGGCCTGCTG GGGATGCGCG ACGCCAACGT TCTCGAATGG
CTGACCGAGC GGGCTCGGTC CACGCGGCGC GTGGGCAGCG TTTGCTCGGG CGCGTTCGTG
CTGGCCGCGG CGGGGATGCT GGACGGCCGG CGCGCCACGA CCCACTGGGA AGTCTGCGAC
GAGATGCGCG CCTTTCGACC GGCCGTGAGG TTGGAGCCGG ATGCGATCTT CGTGGCCGAT
CCGCCGTACT ACACGTCGGC GGGCGTGACG GCCGGCATCG ATCTTTGCCT GTCCTTCGTG
GAGGAGGACT GTGGACCGGA GCTGGCGCTG GCGATCGCTC GCAATCTCGT CCTCTTCATG
CGCCGGCCGG GCGGGCAGAC GCAGTACAGC ACCGGGCTCA ATGTGCAGGT CGCGGCCACG
CCGAAGCTGC GCAGCCTGAT CGCCGAGATC AGCGCCGATC CCGGCGGCGA CCAGACGCTG
CCAAGCCTCG CCGACAAGGC CGGCATGACC GAGCGAACGT TCAGCCGCGT CTTCCACAAG
GAGACCGGAA CCACTCCGGC GGCGTTCGTG GAAATGGCCC GGGTCAACCG CGCCAAGGCT
TTGCTGGAAA CCTCCGACTG GCCGCTGGCG CGTGTCGCCG AGCGCTCGGG CTTTGGCAGC
CTGGACGCGC TGCATCGGGC CTTTCAAAAA CGCGTTGGGG CGACGCCGGG CGACTATCGG
GCTCGGTTCG GCCGCCAACC GGCTCAGTAG
 
Protein sequence
MPDFPTRPRK IAIVGYRGAQ SLDINGPFEV FAMANRFGGV TVYEPILASP HGGAIVCNSG 
LGIAGSVAFA DLPTDLDTIL VAGGDEEGLL GMRDANVLEW LTERARSTRR VGSVCSGAFV
LAAAGMLDGR RATTHWEVCD EMRAFRPAVR LEPDAIFVAD PPYYTSAGVT AGIDLCLSFV
EEDCGPELAL AIARNLVLFM RRPGGQTQYS TGLNVQVAAT PKLRSLIAEI SADPGGDQTL
PSLADKAGMT ERTFSRVFHK ETGTTPAAFV EMARVNRAKA LLETSDWPLA RVAERSGFGS
LDALHRAFQK RVGATPGDYR ARFGRQPAQ