Gene Caul_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2998 
Symbol 
ID5900453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3261698 
End bp3263122 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID641563495 
ProductXRE family transcriptional regulator 
Protein accessionYP_001684623 
Protein GI167646960 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA AGCTTTTCAT CGGTCCAAAG CTTCGCACCC TGCGCCTGGC CAAGGGGTGG 
ACGCTGGATG TGTGCGCCGG TCGCCTGGGC CTGTCGGCCA GCTATCTGTC GCAGATCGAA
GTCAACCAGC GTCCGGTCAC CGCGCGCGTG CTGATCGACG TGATGCGCGT GTTCGAGGTC
GACGCCGCAT CCCTTGACGC GGTCGATGAC CACCGTCTGA TCGCCGACCT GCGCGAGGCC
GCCGCCGACG GGATTCAGGG CGGCGTCGCG CCGGGCCTGC AGGAACTGAA ATCGGCGGTG
GCCAACACGC CCAATCTGGC GCGCTCCTAC CTGGCGCTGC ATCACGCCTA TCGGCGGCTC
GATGAGCGGC TGAAGATCAC CGAGGAGGCG GTCTCGCTCG ACGAGCGTGG CGCGGCCAGC
GCGCTCTTGC CCTACGAGGA GGTGCGCGAC TTCTTTCACT ACAAGAACAA CTATATCCAT
AGCTTGGACG TCGCGGCCGA GGGTCTGGCC TTGAGCCCCG ACGACGGCGA GACCACCGAG
ACGGCGCTGG AGCGACGGCT CGGCGCGCTG GGCGTCAAGG TGCGGCGCTC GGCTGAACTG
GACCTGCTTC GGCGGTTCGA CCCCGCGGCC GGCCTGCTGG TGCTGGGTTC GGCCCATTCT
AGCGCCACGC GTTGTTTCCA GATGGCCTAT CAGATCGCCG CCGCGACCCT GGCGGAAACG
GTCGAGGCGG AGTTAGCGGC GGCCGGGTTT CGCAGCGACA GCGCCGCCAA GGTCTGCCGC
ACCGGCCTGT TGAACTACGC GGCTGGAGCG ATGCTCCTGC CGTATGAACG GTTCCGCGAG
GCGGCTCGCG CGACACGGCA CGACATCGAG CGGCTGAGCC TGATGTTCCA GACCAGCCTC
GAGCAGGTGT TCCATCGCCT CAGCACCCTG CAGAGGCCGG GCGCGCGGGG CTTGCCGTTC
TACTTCGTTC GCGTCGACCA GGCCGGCAAC ATCACCAAGC GCCACAGCGC CACCCGCCTG
CAGTTCGCGC GCTTTGGCGG AACCTGCCCG TTGTGGAACG TGCATGACGC CTTCGCCCGG
CCCGACAAGT GGCTGGTGCA ACTGGCCGAG ATGCCCGACG GCGTGCGCTA TGTGAGCATC
GCCAAGGGGG TGGTCAAACC CTCGGGTTCC TATCTGCGTC CCGACAGGCG CTACGCCCTG
GGCCTGGGGT GCGAGACCCA GTACGCGGAC CAGCTGGTCT ATGCGCAGGG CCTGGACCTG
GCGGGACCGC CCGCGCCCAT CGGCGTCAGC TGCCGCATCT GCGAACGCGA CGACTGCGCC
CAACGCGCCT TTCCGCCCGT GGACCGAGAT TTCGAGGTGT TCGAGAACGA GCGGCGTCTT
GTGCCGTTCA CGCTCCGCAC CGTTGACGGC CATCCCTCGG CCTGA
 
Protein sequence
MSEKLFIGPK LRTLRLAKGW TLDVCAGRLG LSASYLSQIE VNQRPVTARV LIDVMRVFEV 
DAASLDAVDD HRLIADLREA AADGIQGGVA PGLQELKSAV ANTPNLARSY LALHHAYRRL
DERLKITEEA VSLDERGAAS ALLPYEEVRD FFHYKNNYIH SLDVAAEGLA LSPDDGETTE
TALERRLGAL GVKVRRSAEL DLLRRFDPAA GLLVLGSAHS SATRCFQMAY QIAAATLAET
VEAELAAAGF RSDSAAKVCR TGLLNYAAGA MLLPYERFRE AARATRHDIE RLSLMFQTSL
EQVFHRLSTL QRPGARGLPF YFVRVDQAGN ITKRHSATRL QFARFGGTCP LWNVHDAFAR
PDKWLVQLAE MPDGVRYVSI AKGVVKPSGS YLRPDRRYAL GLGCETQYAD QLVYAQGLDL
AGPPAPIGVS CRICERDDCA QRAFPPVDRD FEVFENERRL VPFTLRTVDG HPSA