Gene Plav_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2020 
Symbol 
ID5457047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2208186 
End bp2209535 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content63% 
IMG OID640877597 
Productmajor facilitator transporter 
Protein accessionYP_001413291 
Protein GI154252467 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.34408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG ACGAAGATAT CAGAACGGCG GAAACCGCTG CACCGGCCGC CAACCGGCCT 
CTGCGTCCCT CCGATCCCGA CAGGCAGACT ACGCCGCAGG AGCGCAAGCG CGCCTTCTTC
ATCCTCTTCG TCTGCCTGCT CTCGGTCGGC ATGGGCCAGA CACTCGTCTT CGCCGTCCTG
CCCCCCATCG CCGCCGATCT CGGCATGTCC GAGTTCGAAA CGACGATGAT CTTTTCACTC
TCCGCCGCGC TCTGGGTGCT CACCAGCACC TTCTGGGGCC GCCGCAGCGA CGTCATGGGC
CGCAAGCCAG TCATCCTGAT TGGCGTTTTC GGTTTCGCTT TCTCTACTTC GCTTGTCGGC
TTTGTGCTGC TGGCCGGCTA TCAGCACTGG ATTTCGATGG TGTTGATGTT CCCGCTGGTC
ATGGGTGCGC GGGCGATCTT CGGTATTTTC GGTTCCGGCT CAATGCCCGC CTCGCAGGCC
TATGTCGCAG ACCGCACCAC GCGGGCCGAG CGGGCAGGAA GCATCGCGCA AATCGGCGCC
GCCTTCGGTC TCGGAACGGT CGTCGGCCCC GGCATCGCCG CAGCCTTCGC GGAAATCCAC
ATCCTCGCAC CTTTCTTCGC TGTCGGTGCG CTGGCCTTTG TCAGCGGCAT AGCGATCTGG
ATCCTGTTGC CGGAGCGTAC GCGGCCAAAG AAGATGCTTT CGGAGAAACA GGAAAAGCAG
GCCGCGTTGA AGTGGAGCGC GCCCAATGTC ATGCCCTGGC TCGTCATCGG CGTGATCCTC
AGCCTCAGCC AGTCGATCAT GATGCAGCTC TTCGCCTTCT ATCTGATGGC GGAGCTCGGC
ATGCAGGGGT CGGAAGCCAC GCAGCTTGTC AGCGTCGGCA TGATGGCCAT GGCGATGGCG
ACCCTCGTCG CGCAGCTCGG CCTCATTCAG CGGTTCGACC TCTCGGTGCA ATTCCTGCTG
CGCTGGGGTG CTGTCACCAT GATCCTTTCA TTCGCGATGC TGGTGTTCGG CGGCTCCTAC
GGCGTGCTTG TCTCGGCGCT GGCGCTTGCC GGTCTCGGCT TCGGCTTCCT GCGCTCCGGC
CTGTCCGCCG GCGCGTCACT CTCCGTTTCG CTGAAGGATC AGGGCGCGGT CGCAGGCCTC
ATCGGCTCGA CTGCGGCCAC CGGCCACATC CTCAATCCCG CCATCGGTAT CCCGCTCTTC
TACCTGCTGC ACTCGGCGCC CTTCATGCTT GGCATCGGCC TCATGGTGAT GATCCTCGCC
TTCGCGATCT TTCATCCGGT GCTGAAAAAT CTGCGCAGCG GCGATACGGG CCTGGTCGAC
GAGGAGGAAG AGGAGATCGG GCTGCACTAA
 
Protein sequence
MAEDEDIRTA ETAAPAANRP LRPSDPDRQT TPQERKRAFF ILFVCLLSVG MGQTLVFAVL 
PPIAADLGMS EFETTMIFSL SAALWVLTST FWGRRSDVMG RKPVILIGVF GFAFSTSLVG
FVLLAGYQHW ISMVLMFPLV MGARAIFGIF GSGSMPASQA YVADRTTRAE RAGSIAQIGA
AFGLGTVVGP GIAAAFAEIH ILAPFFAVGA LAFVSGIAIW ILLPERTRPK KMLSEKQEKQ
AALKWSAPNV MPWLVIGVIL SLSQSIMMQL FAFYLMAELG MQGSEATQLV SVGMMAMAMA
TLVAQLGLIQ RFDLSVQFLL RWGAVTMILS FAMLVFGGSY GVLVSALALA GLGFGFLRSG
LSAGASLSVS LKDQGAVAGL IGSTAATGHI LNPAIGIPLF YLLHSAPFML GIGLMVMILA
FAIFHPVLKN LRSGDTGLVD EEEEEIGLH