Gene Plav_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2122 
Symbol 
ID5455208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2302396 
End bp2303736 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content61% 
IMG OID640877699 
Productbeta-propeller repeat-containing to-pal system protein TolB 
Protein accessionYP_001413393 
Protein GI154252569 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.256883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.227077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATG TTCAGACGGT CCGGCGCGGA AACGCGGTTC AGAGTCTCAT GTCGAAGCTC 
ATCCTGCCAT TGGTCATGGC GGTGGCTTTC GCTTTGCCGG CGCGGGCGGC ACTGCAGATC
GACATTACCC AGGGCAACGT CGATCCGTTG CCGATTGCCA TCACCGATTT TGTCGGTGAA
GGGTCTGTCG GCGCCGACAT GTCGGCTGTC ATAAGCAACA ATCTGGAGCG CTCGGGACTC
TTCCGGCCCT TGCCGAAAGC GTCTTTCATC GAAAAAGTTT CCGACATCAA TGTCCAGCCG
CGCTTCGGCG ACTGGCGCGT CATCAATTCC CAGGCGCTGG TGACGGGGCA GACGCGAATG
GAAGCCGATG GCCGTCTCCG CGTGGAGTTC AGGCTTTGGG ATGTGCTCGG CGAGCAGCAG
CTTACCGGTC TACAGTTTTT CACCACGCCC GATAACTGGC GCCGCGTCGC ACATCTTATT
TCCGATGCGA TCTACAAGCG GCTCACGGGC GAGGATGGAT ATTTTGACAC GCGTATTGTC
TATGTATCCG AAACGGGACC TAAAAACGCG CGTGTGAAGC GCCTCACGAT CATGGATCAG
GATGGGCACA ACCCGCGCAT GCTGACGCGC GGCAACGAGC TTGTGCTGAC CCCCCGCTTC
AGCCCCAACT CCCAGGAAAT CACCTATCTC GCCTACCGGA ACAACCAGCC GCGAGTTTAC
GTCCTCGACA TCGAGACGGG GCAGCAGGAA GTGGTAGGCG AATTTCCCGG CATGACCTTC
GCGCCGCGTT TTTCGCCGGA CGGACAGCGC ATCATCATGA GCCTTCAACG CGGCGGGAAT
TCCGACATCT ACACCATGGA TCTCCGGAGC CGCCAGGTTG TGCGTCTCAC CAATACGGCG
GCCATCGACA CGGCGCCGAG TTATTCGCCG GATGGCCGGC AGATTACCTT TGAATCGGAC
CGAGGCGGTT CGCAGCAGAT TTACGTCATG GATGCGAGTG GCTCGAATCA GCGCCGTATC
AGCTTCGGGC AGGGCAGCTA TGCCACGCCC GTCTGGTCGC CGCGCGGCGA TCTCATCGCC
TTCACCAAGA TAACAGGCGG CCGGTTCGTG ATTGGCGTCA TGCGGCCCGA CGGTACCGGC
GAACGCGTGC TGACGGATGG CTTCCACAAT GAGGGGCCGA CATGGGCTCC CAATGGCCGT
GTTCTGATGT TTTTCCGTGA AACGCGGGGC GCACAGGGCG GGCCAAGCCT CTGGTCGGTC
GATGTTACCG GATATAACGA GCGCCCGTCG CCAACGCCGA CGTTTGCCTC GGACCCGGCC
TGGTCGCCGC GGATACAGTA G
 
Protein sequence
MMDVQTVRRG NAVQSLMSKL ILPLVMAVAF ALPARAALQI DITQGNVDPL PIAITDFVGE 
GSVGADMSAV ISNNLERSGL FRPLPKASFI EKVSDINVQP RFGDWRVINS QALVTGQTRM
EADGRLRVEF RLWDVLGEQQ LTGLQFFTTP DNWRRVAHLI SDAIYKRLTG EDGYFDTRIV
YVSETGPKNA RVKRLTIMDQ DGHNPRMLTR GNELVLTPRF SPNSQEITYL AYRNNQPRVY
VLDIETGQQE VVGEFPGMTF APRFSPDGQR IIMSLQRGGN SDIYTMDLRS RQVVRLTNTA
AIDTAPSYSP DGRQITFESD RGGSQQIYVM DASGSNQRRI SFGQGSYATP VWSPRGDLIA
FTKITGGRFV IGVMRPDGTG ERVLTDGFHN EGPTWAPNGR VLMFFRETRG AQGGPSLWSV
DVTGYNERPS PTPTFASDPA WSPRIQ