Gene Caul_4427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4427 
SymboltolB 
ID5901888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4796296 
End bp4797621 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID641564945 
Producttranslocation protein TolB 
Protein accessionYP_001686045 
Protein GI167648382 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.500534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGA AGTCCGTCTT CACCGGCCCC CTGGCGGCTT GGGCGACAGC CGCCGTCCTG 
GCGCTGTCCA TGGCCGCCCT CGCGCCGACC GTCGCCCGCG CCCAGATCGA GGTCGATATC
GACAAGGGCG CGGTCAAGCC GCTGCCGGTC GCCATTCCAG CCTTTTCGGG CGGCGGCCGC
GGCGCCGACA TCGCCCAGGT CATCAGCGGC AATCTCGAGC GCTCGGGGCT GTTCCAGCCG
CTGAACGTGG CCAATGTCGC CGACAAGCTG GCCGACGTGA ACGTCCAGCC GCGCTTCCCC
GACTGGCAGG CCACCGGGGC CCAGGCCCTG ATCAACGGCC AGGTGACGGT CGGCGCCGAC
GGCGTGCTGC GCGTCGACTT CCGCCTGTGG GACACCTTCA GCCAGCAACA GCTTCTGGGC
CTGCAATTCA CCTCGACCGC CGAGAACTGG CGGCGGGTCG CCCACAAGAT CAGCGACGCG
GTCTACGAGC GGCTGACCGG CGAGAAGGGC TATTTCGACA CCCGCGTGGT GTTCGTCGCC
GAGAGCGGCG GCAAGCTGAC GCGCGTCAAG CGTCTGGCGA TCATGGACCA GGACGGCGCC
AACCCGCAGT ACCTGACCGA CGGCTCCTAC ATCGTCATGA CCCCGCGCTT CTCCTCGACC
AGCCAGGAGA TTACCTACAT GGCGCTGCGG CCCACCGGGT CGAGCATCTA CCTGACCAAC
CTGGAGACGG CTCGCACCGA GACCATCGGC AAGTTCCCGG GCATGGTCTT CGCCCCGCGC
TTCTCGCCGG ACGGCGGCAA GGTGGCCTTC TCGGTCGAGA AGGGCGGCAA CAGCGACATC
TACGTGATGG ACCTGCGCAG CCGCCAGTCG ACGCGGATCA CCACCGACCC GGCCATCGAC
ACTTCGCCGT CGTTCTCGCC GGACGGATCG AAGATCGTCT TCAACTCCGA CCGCGGCGGC
CAGGCCCAGC TCTACATCAT GAACGCCGAC GGCAGCGGCG TGCGCCGCAT CTCGTACGGC
GGCGGCCGCT ACACCACGCC GGTGTGGAGC CCGCGCGGCG ACTTCATCGC CTTCACCAAG
CAGACCGGCG GCGAATTCCA CATCGGGGTC ATGAAGGTCG ATGGCGGCGA CGAGCGGCTG
CTGACCACCA GCTATCTCGA CGAAGGCCCG ACCTGGGCGC CCAACGGCCG GGTGCTGATG
TTCTCGCGCG AGGGCTCCAG CGGCAATTCG CGGCTCTGGA CGGTGGACAT CACCGGCCGG
ATCCTGCGCC CCGCCGCCTA TACGGGCGCG GCGTCAGACC CCGCCTGGTC GCCGCTTCTG
GATTGA
 
Protein sequence
MNLKSVFTGP LAAWATAAVL ALSMAALAPT VARAQIEVDI DKGAVKPLPV AIPAFSGGGR 
GADIAQVISG NLERSGLFQP LNVANVADKL ADVNVQPRFP DWQATGAQAL INGQVTVGAD
GVLRVDFRLW DTFSQQQLLG LQFTSTAENW RRVAHKISDA VYERLTGEKG YFDTRVVFVA
ESGGKLTRVK RLAIMDQDGA NPQYLTDGSY IVMTPRFSST SQEITYMALR PTGSSIYLTN
LETARTETIG KFPGMVFAPR FSPDGGKVAF SVEKGGNSDI YVMDLRSRQS TRITTDPAID
TSPSFSPDGS KIVFNSDRGG QAQLYIMNAD GSGVRRISYG GGRYTTPVWS PRGDFIAFTK
QTGGEFHIGV MKVDGGDERL LTTSYLDEGP TWAPNGRVLM FSREGSSGNS RLWTVDITGR
ILRPAAYTGA ASDPAWSPLL D