Gene Plav_3085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3085 
Symbol 
ID5454281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3294976 
End bp3296430 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content63% 
IMG OID640878674 
Productsodium/proline symporter 
Protein accessionYP_001414349 
Protein GI154253525 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.678112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.198592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTCA CCGCCGTCAC AATCATCCTC TATCTCCTCG CCATGCTCGC TCTCGGCGTG 
GCCGCCTATC GCTTTACCCG CAACCTCAAG GATTACATCC TCGGCGGCCG CCAGCTCGGC
GGCGCCGTCG CGGCGCTTTC GGCGGGCGCC AGCGACATGA GCGGCTGGCT GATGCTCGGC
CTGCCGGGCG CGATCTACGC CTCCGGCCTC AATCAGATCT GGATCGCGGT CGGCCTTGTC
GCGGGCGCGC TTCTCAATTG GCGCTTTATT GCGCGCCGCC TGCGGCGTTA TACCGAGCGG
GCTGGCGATG CGCTCACGAT ACCGGATTAC TTCGAAAACC GTTTCGCCGA TAGAAGCCGC
GTCCTCCGCG TTGCCAGCGC ATTCGTCATT CTCATCTTCT TCACGATCTA CACCTCTGCC
GGGCTTGTCT CCGGCGCCAT TCTTTTTGAA CAGGTTTTCG GCCTCGACTA CCGGATCGCG
CTTCTCGTCG GCACCGTCAG CATCCTCTCC TACACGGCGA TCGGCGGCTT CCTCGGCGTC
TGCTGGACGG ACACGGTGCA GGGCGTGATG ATGTTCCTCG CCTTGCTCAT CGTGCCCGCC
GTCGCCATCA CGAGCATGGG CGGCTGGGGC GCCACCATGA CGGCAACATC CGCCGTCGCT
CCCGCAGGCG TTTTCGATGC CTTTCACGAT ACGGGTGTAA TCGGCGCCGT CTCCTTGATG
GCGTGGGGCC TTGGCTATTT CGGTCAGCCG CACACCCTCG CGCGTTTCAT GGCGCTTCGC
GACGAGCGCG ATATGCCCGC GGCGCAGATG ATCGGCATGA CATGGATGGT GCTGGCGCTT
TACGGTGCGA TTGCGACGGG GCTTGCCGGC ATCGGCTATT TCGCCGCCGC GCCGCTCGAC
AATCCGGAAA CGGTTTTCCT CTCTCTTTCC CAGTCGCTCT TCAATCCGTG GATAGCAGGT
ATTCTCCTCG CCGCCGTGCT CGCCGCTATC ATGAGCACGG TCTCTTCGCA GCTGCTGGTT
TCGTCGAGCG TCATCGCAGA AGATTTCTGG AAGCGGTTGT TGCGGCCGCA AGCGGAAGCA
GGGGAGTTGC TGAATATCGG CCGCGGCTCC GTCTTTGTCA TTTCGATCGC CGCCTTCCTG
CTGGCACTCG ACCGCGACAG CAGCGTGCTT TCCCTTGTCG CCTATGCCTG GGCAGGCTTT
GGCGCTGCCT TCGGGCCGGT CGTCGTGCTC TCGCTCTTCT GGCGGCGCAT GACGCGTGAC
GGCGCGCTGG CCGGCATGGT CGTCGGCGCG GTAACTGTCA TCGTCTGGAA ACAGTTCTCG
GGCGGCATAT TCGAAATCTA CGAAATTCTT CCCGGTTTCA TTTTCGCGAG CATCGCCATT
GCCGGCGTCA GTCTTCTCGG CCGCGCGCCG GGGCCCGAGG TCGTGGATTT ACATGATTCC
GTGGCCGCAG GCTGA
 
Protein sequence
MEVTAVTIIL YLLAMLALGV AAYRFTRNLK DYILGGRQLG GAVAALSAGA SDMSGWLMLG 
LPGAIYASGL NQIWIAVGLV AGALLNWRFI ARRLRRYTER AGDALTIPDY FENRFADRSR
VLRVASAFVI LIFFTIYTSA GLVSGAILFE QVFGLDYRIA LLVGTVSILS YTAIGGFLGV
CWTDTVQGVM MFLALLIVPA VAITSMGGWG ATMTATSAVA PAGVFDAFHD TGVIGAVSLM
AWGLGYFGQP HTLARFMALR DERDMPAAQM IGMTWMVLAL YGAIATGLAG IGYFAAAPLD
NPETVFLSLS QSLFNPWIAG ILLAAVLAAI MSTVSSQLLV SSSVIAEDFW KRLLRPQAEA
GELLNIGRGS VFVISIAAFL LALDRDSSVL SLVAYAWAGF GAAFGPVVVL SLFWRRMTRD
GALAGMVVGA VTVIVWKQFS GGIFEIYEIL PGFIFASIAI AGVSLLGRAP GPEVVDLHDS
VAAG