Gene Caul_3779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3779 
Symbol 
ID5901241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4096200 
End bp4097219 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID641564302 
ProductAraC family transcriptional regulator 
Protein accessionYP_001685404 
Protein GI167647741 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0505426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATTT TCAGCACGGA CGATCTGACG GGCCGCTCAT CTCTCGTCGA GTGGGCAGGG 
CTGTTCGCGG GGCGCATCGA GGCGATGGAT TTCAAGCCCA GCAGGCCAGA GACGTTCACG
GCCCAGCTGG CGGGCAGCGA TCTGGGTCCC CTGCATCTTG CCCGTCTCGC CTGCGCGAAA
ACCACGATCG AGCGGGGCGA AGGGCATCTC GCGCACAAGA CTTCTCCGGC CTATTTCCTG
ATGCTGCTGA TTGGCGGCAG CGGCGAGATC AGCCACTACG GCAACACGAT CACGCTCGAG
GAAGGTGATT TCGTCCTGTG CGACAGCACC GCGCCGCTGA AGATCGCGTT TCCGGATGAC
GCCGAGGCGC TCTTCCTGAA GGTCGGGGCG TCGACGCTCA AGGAGCATCT CCCGTCGCCC
GAATGCTTTT GCGGACGGGC GCTCCGCGCG GACGAGGGCC TGACGGCGAC GGCGGGGGCC
ATGGCGCTCA ATCTGTTTGG CCGCCTCGAA GCGGGGCTGG CGCTGCCCTA TCAGGACCGC
GTCGCGCGGC ACCTTCTCGA TATCCTGGCG ATGGCCTACG CCCTGGCCTT CGACACGCCC
ACGTCGCGGT CCTCGATCGT CAGCGGCCGG TGCGCGACGG TGAAGCTGTT CATCGAGCAG
CATCTACGTG ATCCCGACCT GACGCCGTGC TCGATCGCGG CGCGGATGAA GCTGTCGTCG
CGCTATCTGC GGATGATCTT CGCCAGCGAG AACGAGACGG TTTCGGCCTA CATCCTGCGC
CGGCGCCTGG AGCAGTGCGC CCGGCAGATC GCCGACCCCG CCTGGCGAGG CCACTCGATG
ACCGAGATCG CGTTCGGCTG GGGCTTCAAC AGCGCGCCCC ATTTCACCCG CACCTTCCGC
GACCGCTACG GCATGCCGCC CCGCGAATAC CGACGCCTCA AGCTGGACGA GGGGGCCGGG
GCTTTCCGGT CCGAATCGGC GCCGCGCCGC GCGGCGCAGC CCGGCGCCCG GGCGGCGTGA
 
Protein sequence
MNIFSTDDLT GRSSLVEWAG LFAGRIEAMD FKPSRPETFT AQLAGSDLGP LHLARLACAK 
TTIERGEGHL AHKTSPAYFL MLLIGGSGEI SHYGNTITLE EGDFVLCDST APLKIAFPDD
AEALFLKVGA STLKEHLPSP ECFCGRALRA DEGLTATAGA MALNLFGRLE AGLALPYQDR
VARHLLDILA MAYALAFDTP TSRSSIVSGR CATVKLFIEQ HLRDPDLTPC SIAARMKLSS
RYLRMIFASE NETVSAYILR RRLEQCARQI ADPAWRGHSM TEIAFGWGFN SAPHFTRTFR
DRYGMPPREY RRLKLDEGAG AFRSESAPRR AAQPGARAA