Gene Caul_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4073 
Symbol 
ID5901535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4412209 
End bp4413726 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID641564594 
ProductAraC family transcriptional regulator 
Protein accessionYP_001685696 
Protein GI167648033 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGAC CCATGGATCT CGACCCCGAC GTTTGCTACC GCGCCATCCA GACCCGCGAC 
GCCCGCTTCG ACGGCCGGCT GTTCGTGGCG GTCCGGACGA CGGGGATCTA TTGCCGGCCG
GTCTGCCCGG CCCGCACGCC GCTGCGCCAG AACGTCACCT TCCACGCCAC CGCCGCCTCG
GCCGAGGCGG CGGGCTACCG CGCCTGCCTG CGCTGCCGGC CCGAGACCTC GCCGCAACTG
GGGGCCTGGA ACGGCGCGTC CAACACCGTC TCTCGCGCCC TGGCCCTGAT CGAGGCCGGC
GCTCTGGATG GCGGCGACGT GGAAGGCCTG GCCGAGCGCG TCGGGGTCGG CGGGCGGCAA
CTGCGGCGCC TGTTCCTGCG GCACCTGGGC GCGACGCCGG TCGGGGTGGC CCAAACCCGG
CGGGTGCTGC TGGCCAAGCA GCTGATCCAC GAGACCGACC TGCCGATGGG CGAGGTGGCC
CTGGCCGCCG GCTTCGGCAG CGTGCGACGC TTCAACGAGA CCTTCCAGCA GCTCTATGAT
CGGCCGCCCG CCGCGCTGCG CCGTCGCAAG TCCGCCTCTC CCGTTGGTGA GCCGCCGCCC
GGCGAGGCCG TCGCCCTGAC CCTGCGCTAC CGTCCGCCCT ACGACTGGGA CGCCATGCTG
GCCTTCCTGG CCCTGCGCGC CATTCCCGGC GTCGAGGTGA TCGAGAGCAA TACCTACCGC
CGGGTGATCG CCCTGGACGG CGCGGCCGGG ACCATCGCCG TCAGTCCGAT CGACGGCGAC
CGGCTGAGCG TGGCGGTGCG CTTTCCCAAG CTTTCGGCCC TGCCCCGCAT CCTGGCCCGC
GTGCGGGGGG TGTTCGACCT GTCGGCCGAC CCGGTCGGGA TCGCGGCGGT GCTGTCGCGC
GATCCGGACC TGGCGCGGAT GGTCGGCCTG CGTCCCGGCC TGCGCGTGCC CGGGGCCTGG
GACGGGTTCG AGCTGGCGGT GCGGGCGATC CTGGGCCAGC AGATCACCGT CGTTCAGGCC
CGCAAGCTGG CCGGCGACCT GGTCGCGGCG CACGGCGAAC CGCTGGCGCA GCCCTGGACC
GAGCCCGGCC TGACCCACGC CTTCCCGTCG GCCGAGCGCC TGGCCGCCAC CAATCTCTCA
GGCATGAAGA TGCCCGGGGC CCGCATTCGC TGCCTGTCGG CCATGGCCCA GGCCATCGCC
GACGCCCCCA ACCTGCTGTC GCCGACCGCC GGCCTGGACG AGATGGTTCG GCGGCTGCGC
GCCCTGCCGG GTATCGGCGA ATGGACGGCG CAGTACATCG CCATGCGCCA GCTACGCGAA
CCTGACGCCT TCCCCGCCGC CGACGTCGCC CTGATGCGCG CCCTCGCGGA CGTCGACGGC
GTTCGTCCGA CAGCGGAGCA ACTTCTGACC CGCGCCGAGG CCTGGCGACC GTGGCGGGCC
TACGCCGCCC TGCACCTGTG GGCCTCGCTG GCGGATGAAG GCGCGCCGCC CGTTCGGAAG
GTGAAGCGTG CGGCCTGA
 
Protein sequence
MIGPMDLDPD VCYRAIQTRD ARFDGRLFVA VRTTGIYCRP VCPARTPLRQ NVTFHATAAS 
AEAAGYRACL RCRPETSPQL GAWNGASNTV SRALALIEAG ALDGGDVEGL AERVGVGGRQ
LRRLFLRHLG ATPVGVAQTR RVLLAKQLIH ETDLPMGEVA LAAGFGSVRR FNETFQQLYD
RPPAALRRRK SASPVGEPPP GEAVALTLRY RPPYDWDAML AFLALRAIPG VEVIESNTYR
RVIALDGAAG TIAVSPIDGD RLSVAVRFPK LSALPRILAR VRGVFDLSAD PVGIAAVLSR
DPDLARMVGL RPGLRVPGAW DGFELAVRAI LGQQITVVQA RKLAGDLVAA HGEPLAQPWT
EPGLTHAFPS AERLAATNLS GMKMPGARIR CLSAMAQAIA DAPNLLSPTA GLDEMVRRLR
ALPGIGEWTA QYIAMRQLRE PDAFPAADVA LMRALADVDG VRPTAEQLLT RAEAWRPWRA
YAALHLWASL ADEGAPPVRK VKRAA