Gene Caul_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3970 
Symbol 
ID5901432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4299033 
End bp4300049 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID641564491 
ProductAraC family transcriptional regulator 
Protein accessionYP_001685593 
Protein GI167647930 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0584916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAAC CTGCGACGAC GCGCGAGGGG CCGCTCCCGC CCCTGGCGAG AACCAATTTC 
AGTACGCGAA GCGTTCCGCC GGAAGAGCGG CACGACTACT ATCGCCGTGA GGTGCTGTCA
GCGCTGGACG CGCGCGATCC CGAGCCGGGC TTTTCGGCCA ACATCACCTC GCTGCGTCTG
GGGTCCCTGG CCTTCTACGT CACCGAGACC GGCGGCCACA CCATGTTCCG CACGCCGGAG
ATGATCGCCG CCGACGGTCG CGACCACTAC ATCGTGCAGT TCAACATCGC CGGCTCGCAT
ACCGGCGACT TCGACGGCGT GCCCTTTTCG GCCGGTCCGG GCGAGGTCGG CATCTGCGAC
CTGTCGCGGC CGATGCTGCT GCACAGCACG GCGGTCAAGG TGCTCTCGAC CTTCCTGCCG
CGCGCCGAGG TCAAGGCGGT GGCGCCCGAC ATCGAACTGC ACGGCATGGT GCTGGACGCC
AACCGCGCCG GCCTGCTGAT CGAGCACCTG GCCTCGGTCA CCCGGTGGTT CCCGCGACTG
CTGCCCGAGA CCCTGCCGGG CATCACCCGC GCCACCATCG AGCTGCTGGG CGCGTGCCTG
GTCATGGAAG CCAGCCGGGC GGACTTCGGC GTGCGCGAGT CGCCGGTGCT GATGCGGGCT
CGCGCCTATG TCGAGCACAA CCTGCTGGAG CCTACCCTCA ACCCGGCCAA GATCAGCGAA
GCGCTGGGCG TGTCGCGCTC GACGCTCTAC CGCCTGTTCG AACCGCTGGG CGGGGTGACG
GCCTATGTCT GGGACCGCCG CCTGCACCTG GCGCGCGCCG CCCTGCTGGA CCCCAAGCGA
GCCCGGCGGA TCAGCGAGAT CGCCTTCCAG TGCGGCTTCA GCAGCGAGGC CCATTTCAGC
CGCAGCTTCC GCAAGGCCTT CAACATCCGG CCCAGCGACC TGCGCTCGCT GCAGCCCAGC
CTGGCCGACG AGCCCGACAG CCCGTTCGCC AAGTGGACCG AGGCGGCGAA GGGGTAG
 
Protein sequence
MIEPATTREG PLPPLARTNF STRSVPPEER HDYYRREVLS ALDARDPEPG FSANITSLRL 
GSLAFYVTET GGHTMFRTPE MIAADGRDHY IVQFNIAGSH TGDFDGVPFS AGPGEVGICD
LSRPMLLHST AVKVLSTFLP RAEVKAVAPD IELHGMVLDA NRAGLLIEHL ASVTRWFPRL
LPETLPGITR ATIELLGACL VMEASRADFG VRESPVLMRA RAYVEHNLLE PTLNPAKISE
ALGVSRSTLY RLFEPLGGVT AYVWDRRLHL ARAALLDPKR ARRISEIAFQ CGFSSEAHFS
RSFRKAFNIR PSDLRSLQPS LADEPDSPFA KWTEAAKG