Gene Plav_1411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1411 
Symbol 
ID5453715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1542843 
End bp1544492 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content61% 
IMG OID640876984 
Productgeneral substrate transporter 
Protein accessionYP_001412688 
Protein GI154251864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.966407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG ACAGGACGGC GCGCATGACG CCGGAAGAGC GGCGGGTGAT TTTCGCCTCT 
TCGCTCGGCA CGGTGTTCGA GTGGTACGAC TTTTATTTGT ACGGCTCGCT CGCGGCGATC
ATCTCCGTGC AGTTTTTCTC CGGCGTCAAT CCGACGGCGG GTTTCATCTT CGCGCTGCTT
GCTTTTGCCG CCGGCTTTGC GGTCCGTCCC TTCGGCGCCA TCGTCTTTGG GCGTCTCGGC
GATCTTGTGG GTCGCAAATA CACGTTCCTG GTCACCATCC TCATCATGGG CGTGGCGACC
TTCATCGTCG GTCTGCTGCC AAATTACGAG ACCATCGGCT TTGCCGCGCC GGCTATCCTG
ATCGCCTTGC GGCTTGCGCA GGGCCTCGCA CTTGGCGGCG AATATGGCGG CGCGGCCATC
TATGTCGCGG AACATGCGCC TCATGCGAAG CGCGGCGCCT ATACGTCATG GATACAGACC
ACAGCGACGC TTGGTCTCTT CCTTTCGCTT CTCGTCATTC TCGGTTGCCG TCTTTCGATG
GACAAGGAGA GCTTCGAGAG CTGGGGCTGG CGTATTCCGT TCCTTCTTTC CATCGTCCTG
CTTGGCATTT CCGTCTGGAT AAGGCTCCGT CTCAACGAGT CGCCGCTGTT CCAGCGGATG
AAGGCGGAAG GGACGCTGTC CAAGGCGCCG CTCACGGAAT CCTTCGCGCG CTGGGGTAAT
CTCAAGATCG TCATCATCGC CCTGGTCGGC CTTACAGCCG GACAAGCCGT CGTCTGGTAT
ACGGGGCAGT TCTATGCGCT CTTCTTCCTG ACGCAGGTGC TGAAGGTCGA CAGCCAGACA
GCGAACATCC TTATCGGCGG TGCGCTGCTT GTCGGCGTTC CCTTCTTCGT AATTTTCGGT
GCCTTGTCTG ACAGGATCGG GCGCAAGCCG ATCATTCTGG CGGGCTGCCT GCTTGCGGCG
TTGACCTATT TCCCTCTCTT CTCCGCGCTC ACGCACTACG CCAACCCCGC ACTGGAAGCC
GCGCAGGAGA GGGCACCCGT CGTCGTCGTG GCCGACAGCG CCGCCTGTTC CTTCCAGTTC
AATCCGGTCG GTACCTCGGC CTTCACCACG CCCTGCGATG TCGCAAAGAG CGAACTCGCA
AAACGTGGCA TCCCCTATTC GAATGCGGAG CTCAGAGCAG GGGAGGAAAC GCGCATCGAG
GTAGGCTCTA TCGCCGTTCC ATCATTTGAC GCCAGCGATG ACGCCGGTGG TGAGGCGCGC
GCCGCTTTCG CTGCGGCGCT GACGCTTGCC TTGACGGAAG CCGGCTATCC GCTCGCGGCC
GATCTCGCCG CCATCAACTA TCCGATGGTG GTGCTGATCC TCTTCATCCT CGTTCTCTAT
GTGACGATGG TCTATGGGCC GATCGCGGCG ATGCTGGTCG AGCTTTTCCC GACGCGCATC
CGCTATACGT CGATGTCGCT TCCGTATCAC ATCGGCAATG GCTGGTTCGG CGGCTTCCTG
CCGACGGTCT CTTTCGCCAT CGTGGCGGCG ACGGGAAACC TCTATTCAGG TCTTTGGTAT
CCGGTCGCCA TCGCCGCCAT GACCTTTGTC GTGGGGCTCA TCTTCGTGCC CGAAACGAAA
GACAGGGCGC TGCATCCGGA AGAAGGCTGA
 
Protein sequence
MAEDRTARMT PEERRVIFAS SLGTVFEWYD FYLYGSLAAI ISVQFFSGVN PTAGFIFALL 
AFAAGFAVRP FGAIVFGRLG DLVGRKYTFL VTILIMGVAT FIVGLLPNYE TIGFAAPAIL
IALRLAQGLA LGGEYGGAAI YVAEHAPHAK RGAYTSWIQT TATLGLFLSL LVILGCRLSM
DKESFESWGW RIPFLLSIVL LGISVWIRLR LNESPLFQRM KAEGTLSKAP LTESFARWGN
LKIVIIALVG LTAGQAVVWY TGQFYALFFL TQVLKVDSQT ANILIGGALL VGVPFFVIFG
ALSDRIGRKP IILAGCLLAA LTYFPLFSAL THYANPALEA AQERAPVVVV ADSAACSFQF
NPVGTSAFTT PCDVAKSELA KRGIPYSNAE LRAGEETRIE VGSIAVPSFD ASDDAGGEAR
AAFAAALTLA LTEAGYPLAA DLAAINYPMV VLILFILVLY VTMVYGPIAA MLVELFPTRI
RYTSMSLPYH IGNGWFGGFL PTVSFAIVAA TGNLYSGLWY PVAIAAMTFV VGLIFVPETK
DRALHPEEG