Gene Caul_5367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5367 
Symbol 
ID5897232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp77675 
End bp78931 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID641550659 
Productconjugation TrbI family protein 
Protein accessionYP_001672145 
Protein GI167621637 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2948] Type IV secretory pathway, VirB10 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.196515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.456411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATC TCCCCCCAGA TCCGCCGCTT GATGGGCGGC CGTCTTACGA GCCCGAACGC 
AAGGCCAGCT CGGCGAGCGT GCTCAGCGCA CCCAGGTTCC CGGTCACCCG CTGGAACCGC
AAGTATCTGA TGGCCGGAGC CGCCGTCCTG GCCAGCATCG TGGCCGGCGG ATTCTACCTG
GGCTTTGGCG GGGCGCACGC CACCAAGGGC CGGCCCGACG ATTCGCAGAA CGCCGCTGAC
ACATCGAGCC CGCAGACGCC GGAGATCGCC ACCCGCTATG CTGCTGGCTA CGCCGATCCG
GCGGTGCGGC CGGGCACGAC CAGCTTGCCG CCGCCGGACG CGCTTGCGCC CCCTGCCCCG
ACGACGCAGG CCGCTGGCCA GCCCGGCCAA CCGGCGCCGG TTGATCCCGC CGTTCAGGCG
GCGCGCGAGC AGGCGCTGGC CGCTCGCTCG GCCAGCCCGT TCTTCGGCGG CGCCCAGGCT
CAGCCGCAGG CCGCCTCCCA AACTGGCCCT TTGGCCCCCG ATCCGGGGCC GATGCTGGCA
GCGGCCCTGG TGCCTGGCTT CGGCACGCCG CCCGCGTCGG CGGCCGGCGA CGTGCAGCCG
GCCAATGGCC AGGCCGGCAA GCGTCAGTTC GCGGCCGGCG CCAGGGTCGA TGACTATCTA
ACGAGCCCCC TGCAGGCGCC GATCAGTCCT TGGGAGGTCA AGGCCGGCAC GATCATCTCG
GCCGCCCTGA TCACGGCGAT CAATTCCGAT CTGCCGGGCC AGGTGATCGC CCAAGTCACC
GAGCCAGTGT ACGACCACAG GACCGGGCGC ACGGTGCTCA TCCCTCAGGG CTCGCGGCTG
ATCGGCCAAT ACGACAGCCA GGTCGCCCAC GGCCAAAGCC GCTCGCTGAT CGCCTGGAAC
CGGGTGATCA TGCCCGACGG CCGTTCGATC AACATCGGCT CGATGGCCGG CGCCGATCTC
TCCGGCGCGG CCGGGCTGCA GGACAAGACC GATGGTCACT TCTGGCAACT GGCTCGCGGC
GTGGCGCTCT CGACGGTGTT CTCCGTCGGC GCGGCGGCGG CGCAAGACGC CGGAACCCGC
AGCTCCGGCG GTCTTGTGAT CAACAGCGCC GGCAGCGGGA TTTCCACTTC CGCCCAGCAG
GTCGGCCAGC AGGTCACCGC TCGCGACCTC AACCGGCAGG CCACCTTGCG GATCCGGGCC
GGGTGGCCGC TCCGGGTCAT CGTCAACAAA GACATGATCC TGGCCCCCTA CCCCTAA
 
Protein sequence
MRHLPPDPPL DGRPSYEPER KASSASVLSA PRFPVTRWNR KYLMAGAAVL ASIVAGGFYL 
GFGGAHATKG RPDDSQNAAD TSSPQTPEIA TRYAAGYADP AVRPGTTSLP PPDALAPPAP
TTQAAGQPGQ PAPVDPAVQA AREQALAARS ASPFFGGAQA QPQAASQTGP LAPDPGPMLA
AALVPGFGTP PASAAGDVQP ANGQAGKRQF AAGARVDDYL TSPLQAPISP WEVKAGTIIS
AALITAINSD LPGQVIAQVT EPVYDHRTGR TVLIPQGSRL IGQYDSQVAH GQSRSLIAWN
RVIMPDGRSI NIGSMAGADL SGAAGLQDKT DGHFWQLARG VALSTVFSVG AAAAQDAGTR
SSGGLVINSA GSGISTSAQQ VGQQVTARDL NRQATLRIRA GWPLRVIVNK DMILAPYP