Gene Caul_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3410 
Symbol 
ID5900865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3682657 
End bp3683646 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content70% 
IMG OID641563916 
ProductArsR family transcriptional regulator 
Protein accessionYP_001685035 
Protein GI167647372 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators
[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0187563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0416707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CGTCCGAACA GGTTGTGGAT CTGCTGCGCG CGGCCGGGGA ATCGACCCGC 
CTGCGGGTGC TGGCCCTGTT GGCGATCGAG GAACTGTCGG TGCTGGAGCT GTGCCGCATC
CTCGACCAGA GCCAGCCGCG TGTGTCGCGA CACCTGAAGC TGCTGGCCGA GGCCGGACTG
GTCGAGCGTT TCCCTGACGG AGCGTGGGTC TTCTATCGCC TGGCGCTGAA GTCGCCGGGC
CGGGGCGTGA TCGACCGGGC GTTGACGTTG ATCGATCCCG AGGACTCGGC CGCCCTGGCC
GACGCCGAGA AGCTGACCCT GGTCCGTGCC GAGCGAGCCG CCGGCGCCCA GGCCTATTTC
GCCCGCAACG CCGCGCGCTG GAACGAGATC CGTTCGCTGT ATGTCGACGA GGCCGAGGTC
GAGGCCGCCA TCCTGCGGGC GGCGGGCGAG GGGCCCTTCG ACGAAATGGT CGACCTGGGC
GCGGGCGCGG GACGCATGCT GACGCTTCTT GGTCGCCGCG CGGGCGCGGC GTTGGGGCTC
GATCTGTCGC AGCAGATGCT CAACATCGCC CGCGATGAGG TGGCCAAGGC GGGCCTGGCT
CAATGCGAGC TGCGCCACGG CGACATCTTC GCCACCGGCC TGCCGGGCGG CTGCGCCGAC
CTGGTGACCG TGCACCAGGT GCTGCACTAT CTGGGCGATC CCGCCGCCGC CGTGGCCGAG
GCCGCGCGGC TGGTGGCCGA TGGCGGCCTG CTGCTGATCG CCGACTTCGC CCCGCACGAC
CACGAGTTCC TGCGCGAGAA CCACCAGCAC CGCCGCCTGG GCTTCGCCGA CGCCGAGATC
ATTCCCTGGA TCGAGGCCGC CGGCCTTGTC CTGGACAGCA ACATCGCCCT GCCGCCGACC
TCGGACGAAG GCCTGACCGT CAAGATCTGG ACGGCCCGAC GCCCAAGCGA TCTGGCGGCC
GAAAGAAACG CCGAAAGAAA CGCCGCATGA
 
Protein sequence
MKLSSEQVVD LLRAAGESTR LRVLALLAIE ELSVLELCRI LDQSQPRVSR HLKLLAEAGL 
VERFPDGAWV FYRLALKSPG RGVIDRALTL IDPEDSAALA DAEKLTLVRA ERAAGAQAYF
ARNAARWNEI RSLYVDEAEV EAAILRAAGE GPFDEMVDLG AGAGRMLTLL GRRAGAALGL
DLSQQMLNIA RDEVAKAGLA QCELRHGDIF ATGLPGGCAD LVTVHQVLHY LGDPAAAVAE
AARLVADGGL LLIADFAPHD HEFLRENHQH RRLGFADAEI IPWIEAAGLV LDSNIALPPT
SDEGLTVKIW TARRPSDLAA ERNAERNAA