Gene Smed_2627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2627 
SymboltolB 
ID5323496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2727533 
End bp2728843 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID640791571 
Producttranslocation protein TolB 
Protein accessionYP_001328292 
Protein GI150397825 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATGC TGAGACGCAA TTTTTTCCGC CTTCTGATGG TGCTGGTCGC AGGCTGCGGG 
CTCATTGCCT CGCCGGCAAA GGCGCTCGTC GAGATCGACA TCAACAAAGG TAACGTCGAG
CCGCTGCCGA TCGCGATCAC GGATTTCGTG CAGGGCGAGC TTGCGCAGAA GATATCCGAC
GTCATTGCCG CCGATCTGAA GCGCTCCGGG CTTTTCGCTC CCATCAACAA GGGCGCGTTC
ATCGAGAAGG TCTCCAATCC CGATGCCACT CCGCGCTTCG AGGACTGGAA GGTCATCAAC
GCGCAGGCGC TCGTCATTGG TCGCGTCACA AAAGAAGGCG ACGGCAGGCT GAAGGCGGAG
TTCCGCCTCT GGGATACCTT CGCCGGAACG CAGATGCTGG GTCAGCAGTT CTACACCCAG
CCGGAAAACT GGCGCCGGGT CGCCCACATC ATCGCCGATG CGATCTATGA AAGGATCACG
GGCGAGAAGG GCTATTTTGA CACGCGCATC GTCTATGTCG CCGAAAGTGG TCCGAAAAAT
GCGCGCCAGC GCCAGCTGGC CATCATGGAC CAGGACGGGG CCAATTCCCG CGCGCTCACC
AATTCCAATG ACATCGTGTT GACGCCGCGC TTCTCGCCGA ACCGCCAGGA AATCACCTAT
ATGTCGTTCG AGAACCAGCA GCCACGGGTC TATCTGCTGC AGCTGGAAAC GGGGCAGCGC
GAGGTGGTCG GCAACTTCCC GGGCATGACC TTCGCTCCAC GCTTTTCGCC GGACGGCCAG
CGGGTGATCA TGAGCCTGCA GCAGGAAGGC AACGCCAATA TCTATACGAT GGACCTGCGC
TCGCGCACGA CGACGCGGCT CACCAACACC GCGGCGATCG ACACCTCGCC GTCCTATTCG
CCGGACGGAA GCCGGGTCGT TTTCGAAAGT GATCGCGGCG GCAGGCAGCA GCTCTATGTC
ATGGGTGCCG ATGGCTCGGG CCAGACGCGC ATCTCCTTCG GCGACGGTTC CTATTCGACG
CCGGTCTGGT CCCCGCGCGG CGATCTCATC GCCTTCACCA AGCAGTCGGG TGGGAAGTTC
TCGATCGGTG TCATGAAACC GGACGGCTCG GGTGAGCGTA TCCTCACGAC AGGCTTCCAT
AATGAAGGTC CCACCTGGGC GCCGAACGGC CGCGTGCTGA TGTTCTTCCG CCAGAACGCC
GGCGCAGGCG GCCCACAGCT CTATTCGATC GACCTGACGG GCTATAACGA GCAGCTTGTC
CCGACCCAGG GCTTCGCCTC GGACCCGGCC TGGTCGCCGC TCATGGAGTA G
 
Protein sequence
MEMLRRNFFR LLMVLVAGCG LIASPAKALV EIDINKGNVE PLPIAITDFV QGELAQKISD 
VIAADLKRSG LFAPINKGAF IEKVSNPDAT PRFEDWKVIN AQALVIGRVT KEGDGRLKAE
FRLWDTFAGT QMLGQQFYTQ PENWRRVAHI IADAIYERIT GEKGYFDTRI VYVAESGPKN
ARQRQLAIMD QDGANSRALT NSNDIVLTPR FSPNRQEITY MSFENQQPRV YLLQLETGQR
EVVGNFPGMT FAPRFSPDGQ RVIMSLQQEG NANIYTMDLR SRTTTRLTNT AAIDTSPSYS
PDGSRVVFES DRGGRQQLYV MGADGSGQTR ISFGDGSYST PVWSPRGDLI AFTKQSGGKF
SIGVMKPDGS GERILTTGFH NEGPTWAPNG RVLMFFRQNA GAGGPQLYSI DLTGYNEQLV
PTQGFASDPA WSPLME