Gene Plav_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1066 
Symbol 
ID5456352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1171157 
End bp1173058 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content65% 
IMG OID640876636 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001412345 
Protein GI154251521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components
[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACC GGGCCTCCCT TGCTCTTGCG GCGCTTTTTC TTTCGGTTTG GGGCTTTCCT 
TCGGGTGCCG CAGCCGCGCC GTCATGCGCG ATCGACCGGC CGGTGATGTT TGGCGGGCTC
GACTGGGATT CCAACGCCTT CCACACCGAA CTCGCGCGCC TCATCATCGA GCGCGGCTAT
GGCTGCGAAA CGGATGTGCT GCCGGGCTCG ACGCTGCCGC TGGTGACGGG GCTCGGACAA
GGCGACATCG ACGTGCTGAT GGAAATCTGG CGCGACAATG TGACTGAGCC GTGGAACGAG
GCGCTGGAAG CCGGGACCGT GGTTTCGCTC GGCACGAATT TTCCGGATGC GGTGCAGGGC
TGGTACGTGC CGCGCTATGT GATCGAAGGG GATGCGGCGC GCGGCATCGA GCCGATGGCG
CCGGATCTCC GGTCCGTCTC CGACCTCAAG CAGCATGCCG CTCTCTTTCG TGACCCGGAG
CAGCCGGGCA AGGGGCGCTT CTACAATTGC ATTCTCGGCT GGAGCTGCGA GGAGCAGAGC
ACACGCAAGC TTGAGGGGTA TGGGCTCGAC CGCTTCTACA CGAACTTCAG GCCCGGAACC
GGGGCGGCGC TTGCGGCTGC CATCGCATCC GCCTATGAGC GCGGCGAGCC GGTCGTCGCC
TATTACTGGG GACCGACATG GGTTCTCGGC ACGTACGATC TCGTGAAGCT CGAAGAGCCC
CCATATTCGG AAGAGGCCTG GAACGCCTTC AACGCCGACC CGAAGCGGAA CCCGCCCGTC
GCCTTTCCAA CCGTCGAGGT CATCATCGGC GCCAATGCCC GTTTCGCGGA GAGGGCTCCC
GAGATCACGG CCTTCCTGCG CGCATATGAG ACGACGGGGC AGATGACGAG CGAGGCGCTC
GCCTATATGC AGGCGAACAG CGAGGCGTCG GCGCGGGACG CGGCGTTGCA CTTCATCGAG
ACACGGCCGG ACATCTGGCG GCAATGGGTG ACGCCCGAGA TTGCGGCGCG GGTGACCGGC
GAACGGGAAG CGGAAGCGAG GAGCTTTCCC GTTGCTCTCG AATTTCCCGT CGAGGCCTGG
GTCAATCGCG CGATGAAGAA CTTCGTTGCC GATTACGGCA ACGCTTTCCA TGCGGCGAGC
GGCTGGCTGC TCGCGCTTAT CGTGGCGCTG GAGGCCGGGC TCGGTGCCCT GCCCTGGTGG
CTCATCATTC TCGCGGTGGC CGGCGCAGCC TGGCATGCCT CGCGCCATTT TGCGTTGCCC
GTCATACTGG CGGGATTGCT GTTTCTGATC GGCACTCTCG GTCTCTGGGA CCTCGCGATC
CAGACTTTGG CCCTGATGCT CGTGTCGGTC TTCTTCGCTG TGCTGATCGG GCTGCCTTCG
GGGATTGCGA TCGCCGCCAG CGATATTGCT CGACGGATCG TTCTGCCGGT GCTCGACGCG
ATGCAGACGC TACCAAGCTT CGTCTATCTC ATTCCGGCGC TGATGCTGTT CGGGCTCGGC
AAGGTGCCTG CGCTCTTTGC AACCGTCATC TATGCAACGC CGCCGCTAAT CCGCCTGGTC
GATCTGGGTT TGCGGACCGT GGACCGGCAA CTGGTGGAAA TGGCGGCTGA TCTGGGCGCA
GACAAGTGGC GGCAGCTCAT CGACATCAAA TTGCCGCTGG CGCTGCCGAG CATCATGGCG
GGCGTCAACC AGACGACGAT GATGGCGCTT TCCATGGTCG TCATCGCGTC GATGATCGGG
GCGCGCGGGC TCGGCGAGGA AGTGCTGCTC GGCATCCAGC GGCTCGATAT CGGGCGCGGG
CTGGCGGCGG GCATCGCGAT CGTCGCTCTC GCCATCGTGT TCGACCGGAT CACGCAGGCT
TATGGGCGCG TGAACCGCAA AGACGCGCCG CCGGGGCGCT AG
 
Protein sequence
MLNRASLALA ALFLSVWGFP SGAAAAPSCA IDRPVMFGGL DWDSNAFHTE LARLIIERGY 
GCETDVLPGS TLPLVTGLGQ GDIDVLMEIW RDNVTEPWNE ALEAGTVVSL GTNFPDAVQG
WYVPRYVIEG DAARGIEPMA PDLRSVSDLK QHAALFRDPE QPGKGRFYNC ILGWSCEEQS
TRKLEGYGLD RFYTNFRPGT GAALAAAIAS AYERGEPVVA YYWGPTWVLG TYDLVKLEEP
PYSEEAWNAF NADPKRNPPV AFPTVEVIIG ANARFAERAP EITAFLRAYE TTGQMTSEAL
AYMQANSEAS ARDAALHFIE TRPDIWRQWV TPEIAARVTG EREAEARSFP VALEFPVEAW
VNRAMKNFVA DYGNAFHAAS GWLLALIVAL EAGLGALPWW LIILAVAGAA WHASRHFALP
VILAGLLFLI GTLGLWDLAI QTLALMLVSV FFAVLIGLPS GIAIAASDIA RRIVLPVLDA
MQTLPSFVYL IPALMLFGLG KVPALFATVI YATPPLIRLV DLGLRTVDRQ LVEMAADLGA
DKWRQLIDIK LPLALPSIMA GVNQTTMMAL SMVVIASMIG ARGLGEEVLL GIQRLDIGRG
LAAGIAIVAL AIVFDRITQA YGRVNRKDAP PGR