Gene Caul_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2035 
Symbol 
ID5899490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2178362 
End bp2179561 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID641562524 
Productconjugation TrbI family protein 
Protein accessionYP_001683661 
Protein GI167645998 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC CGCGTGAAGA GGGTGTGGCA GCGGCTTCTG AGAGCGACAA GGCGCCGCCG 
GAGACCCTGA CCCTGCGGGC CAAGCCCCGG TCGGTGGTGA AGTTCCGCCG TGGGCTGGTG
ATCGGCACGG CTGCTGTGGG CGCCCTGGCG CTGGCGGGCT TCGGATGGAT GGCGCTGAGC
GCCAAGGCGG TTCACCTCGG TAGACCCGCG GACGATCCGT CCGTCGAAGA TCGAGCCTCA
TCCCGCGAGG CGGTCGCCGC CTTGCCGAAG GACTACACGG CGCCCAGGCT CGGTCCGCCC
TTGCCGGGCG ATCTGGGACG CGCCGTGGTC GACCAGCAGC GCCGGGATCG CGTCGACGTC
TCGCCGCCGG TCAGTTCGAC GTCCACGCCG GCAGATCAGG CTGCCGAGGC GGAACGTCAG
CGTCTAGCGG CCCAGGCCCA TCTGGCGCGG GAGGCCGGCG TCATGGTGCA AGCGACTGCG
CGGGGAGCAG GGGAGACGGC GAGCGCCGGC GCTGTCGCCG CCACGCCGGT CACGGCGCCC
TCATCCCTTT CCGGGGAGGC GCACGGCAAG CAAGCCTTTG TCGAGAAAGC CGGCCCGGCC
GAGATCCATA ACGCCCACCA ATTGGAGGCC CCGCGTTCGC CCTATCAGCT CATGGCCGGG
AGCATCATCG CCGCCAGCTT GATCACCGGC CTCGACTCCG ACCTTCCGGG CCAAGTCATC
GCCCAGGTCA CCGAGCCGGT CTACGACACC GCCACAGGCG CATATCTGCT CATCCCACAG
GGGGCGCGGT TGATTGGCGT CTATGACAGT GTCGTGGCTT TCGGCCAGAC CCGAGCGCTG
CTGGTCTGGC AGCGTCTGAT CCTCCCCGAT GGATACTCGA TCCAGCTGGA CAACCTGCCG
GCTACCGACG CGGCCGGTTA TGCGGGCCTA GCCGACAAGG TCGATTTTCA TACCTGGCAG
CTTCTCAAGG GCGTGGGCCT ATCGACCCTC CTGGGCGTGG GCACGGAAAT CAGCTTCGGC
GACGATGAGA GTGATCTGGT CCGCGCCATC CGCCAGTCGA CCCAGCAGAG CGCCTCCCAG
GCGGGGCAGC AGGTCGTGTC CAAGCAGCTC GATGTGCAGC CAACCCTCCG AGTTCGGCCA
GGCTGGCCTC TGCGCGTCAT CGTCCACAAG GACCTAACCC TGCGCCCCTG GCGCGCTTGA
 
Protein sequence
MSTPREEGVA AASESDKAPP ETLTLRAKPR SVVKFRRGLV IGTAAVGALA LAGFGWMALS 
AKAVHLGRPA DDPSVEDRAS SREAVAALPK DYTAPRLGPP LPGDLGRAVV DQQRRDRVDV
SPPVSSTSTP ADQAAEAERQ RLAAQAHLAR EAGVMVQATA RGAGETASAG AVAATPVTAP
SSLSGEAHGK QAFVEKAGPA EIHNAHQLEA PRSPYQLMAG SIIAASLITG LDSDLPGQVI
AQVTEPVYDT ATGAYLLIPQ GARLIGVYDS VVAFGQTRAL LVWQRLILPD GYSIQLDNLP
ATDAAGYAGL ADKVDFHTWQ LLKGVGLSTL LGVGTEISFG DDESDLVRAI RQSTQQSASQ
AGQQVVSKQL DVQPTLRVRP GWPLRVIVHK DLTLRPWRA