Gene Caul_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3824 
Symbol 
ID5901286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4142649 
End bp4144388 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content69% 
IMG OID641564346 
Producthypothetical protein 
Protein accessionYP_001685448 
Protein GI167647785 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3843] Type IV secretory pathway, VirD2 components (relaxase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AGGACGATTT CCGCATCCGG CCCGGCCGCA TCCGCTCGAC CAGCGCGCAA 
CGGGCGCGGC CCTTTATCGC CCAGGCGCTT GCCGCCGCCC AACGCGCGGG CGGCAGCATT
TCCCGACAAG GAAAGATCGG ACCTGGCGCG CGCTCGCGTT TTGGCGCCGG CCAGCGCGCC
TCGATCCAGG CGAACCGCCT GATCACCGCT CGCTCGCGCG GCGCGGTCAT CAAGGCGCGC
GTCGTTCGCC ATAGTGGCCG CTCCGCGCCG CTCGGCACCC ATCTCAATTA TCTCCAGCGC
GACGGCGTCA CCCGCGACGG AGAGAAAGCC CGGCTGTTCG GCTCCGGTGC CGACGACATC
GACGCCAAGG CCTTCGCCGA ACGCGCCGAG GACGACCGCC ACCACTTCCG CTTCATCGTC
AGTCCGGACG ACGCGACGGA GATGTCCGAT CTCAAGGGCT TTACCCGCGA GTTGGTCGGC
CAGATGGAGA AGGATCTCGG CACGCGCCTC GAATGGGTCG CGGTCGATCA CTGGAACACC
GAGCACCCGC ATGTCCATCT GATCGTCCGC GGTGTCCGCG ACGACGGCGA GAACCTGGTC
ATCTCGCGCG ACTACATCAA GGAGGGAATG CGCGACCGGG CGCGCGAGCT GATCACCCAG
GAGTTGGGGC CGCGCACCGA TCAGGAGATC CGCCGCACGA TCGAACGCCA GATCGACGCC
GACCGCTGGA CCAATCTCGA TCGCCAGCTT GCCCGTGACA GCTATCGCAC CGGCGTCATC
GACCTTGCGC CGCGCGCCGA TCGCCAGCCC GACGAATTCC ATGCGCTGAA AGTCGGCCGA
CTTCGCAAGC TCGAAGGGCT CGGCCTCGCC GACGAGATTG GCCCCGGCCA GTGGACTATC
TCCGAAAAAG CCGAGGCGAC GATGCGCGAA CTCGGCGAAC GCGGCGACAT CATCAAGCGC
ATTCATCATG GCCTGACCGA GCGCGGCATC GAGCGTGGCG CTTCGAGCTA TGTACTCGCG
GCCGAGAGCC TCGACGAACC TGTCGTCGGC CGCCTGGTCC AGCGCGGTCT CGACGACGAG
CTGAAGGGCA CGGCCTATGC CGTTGTCGAC GGCATCGATG GGCGCACGCA CCACGTCAAG
CTGCCGGACT TGGACGCCGC CGGCGACAGC GCGCCGGGCT CTATTGTCGA GCTGCGAAAG
TTCGATGACG CCCAGGGGCG CAGGCGCCTC GCGCTGGCGG TCCGATCCGA TCTCGACATC
GAGCGCCAGG TCTCCGCGAC CGGGGCAACG TGGCTCGATC GGCAGGCCAT CGCCCGCGAG
CCGGTGGCGC TTGGCGGCGG TGGCTTCGGC GCCGAGGTGC GCGACGCCAT GGACCGGCGC
GCCGAACATC TTGTCGGCCA GGGCCTCGCC GAGCGGCAAT CGCGCGGCGT CAGTTTCTCG
CGCGGCCTGA TCGACACGCT GCGCCGGCGC GAGCTGGACG CGGCGGCGGC TCGCCTGTCG
ACCGAGACAG GTCGGCCGCT GACGAAACAG AACGAGGGCG AATACGTCAC CGGGGCATAC
CAGCGCCGCC TGACGCTCGC TTCCGGCCGC TTCGCGATGA TCGATGACGG CCTCGGCTTC
AGCCTCGTGC CGTGGTCGCC ATCCCTCGAA AAGCAGCTCG GGAAGCATGT CTCCGGTATC
TCGCGCGCCG ACGGCGGCGT CGACTGGAGC TTTGGCCGAA ACAGGGGGCT CGGCCGATGA
 
Protein sequence
MSGEDDFRIR PGRIRSTSAQ RARPFIAQAL AAAQRAGGSI SRQGKIGPGA RSRFGAGQRA 
SIQANRLITA RSRGAVIKAR VVRHSGRSAP LGTHLNYLQR DGVTRDGEKA RLFGSGADDI
DAKAFAERAE DDRHHFRFIV SPDDATEMSD LKGFTRELVG QMEKDLGTRL EWVAVDHWNT
EHPHVHLIVR GVRDDGENLV ISRDYIKEGM RDRARELITQ ELGPRTDQEI RRTIERQIDA
DRWTNLDRQL ARDSYRTGVI DLAPRADRQP DEFHALKVGR LRKLEGLGLA DEIGPGQWTI
SEKAEATMRE LGERGDIIKR IHHGLTERGI ERGASSYVLA AESLDEPVVG RLVQRGLDDE
LKGTAYAVVD GIDGRTHHVK LPDLDAAGDS APGSIVELRK FDDAQGRRRL ALAVRSDLDI
ERQVSATGAT WLDRQAIARE PVALGGGGFG AEVRDAMDRR AEHLVGQGLA ERQSRGVSFS
RGLIDTLRRR ELDAAAARLS TETGRPLTKQ NEGEYVTGAY QRRLTLASGR FAMIDDGLGF
SLVPWSPSLE KQLGKHVSGI SRADGGVDWS FGRNRGLGR