Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1066 |
Symbol | |
ID | 5456352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 1171157 |
End bp | 1173058 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640876636 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001412345 |
Protein GI | 154251521 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components [COG4176] ABC-type proline/glycine betaine transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACC GGGCCTCCCT TGCTCTTGCG GCGCTTTTTC TTTCGGTTTG GGGCTTTCCT TCGGGTGCCG CAGCCGCGCC GTCATGCGCG ATCGACCGGC CGGTGATGTT TGGCGGGCTC GACTGGGATT CCAACGCCTT CCACACCGAA CTCGCGCGCC TCATCATCGA GCGCGGCTAT GGCTGCGAAA CGGATGTGCT GCCGGGCTCG ACGCTGCCGC TGGTGACGGG GCTCGGACAA GGCGACATCG ACGTGCTGAT GGAAATCTGG CGCGACAATG TGACTGAGCC GTGGAACGAG GCGCTGGAAG CCGGGACCGT GGTTTCGCTC GGCACGAATT TTCCGGATGC GGTGCAGGGC TGGTACGTGC CGCGCTATGT GATCGAAGGG GATGCGGCGC GCGGCATCGA GCCGATGGCG CCGGATCTCC GGTCCGTCTC CGACCTCAAG CAGCATGCCG CTCTCTTTCG TGACCCGGAG CAGCCGGGCA AGGGGCGCTT CTACAATTGC ATTCTCGGCT GGAGCTGCGA GGAGCAGAGC ACACGCAAGC TTGAGGGGTA TGGGCTCGAC CGCTTCTACA CGAACTTCAG GCCCGGAACC GGGGCGGCGC TTGCGGCTGC CATCGCATCC GCCTATGAGC GCGGCGAGCC GGTCGTCGCC TATTACTGGG GACCGACATG GGTTCTCGGC ACGTACGATC TCGTGAAGCT CGAAGAGCCC CCATATTCGG AAGAGGCCTG GAACGCCTTC AACGCCGACC CGAAGCGGAA CCCGCCCGTC GCCTTTCCAA CCGTCGAGGT CATCATCGGC GCCAATGCCC GTTTCGCGGA GAGGGCTCCC GAGATCACGG CCTTCCTGCG CGCATATGAG ACGACGGGGC AGATGACGAG CGAGGCGCTC GCCTATATGC AGGCGAACAG CGAGGCGTCG GCGCGGGACG CGGCGTTGCA CTTCATCGAG ACACGGCCGG ACATCTGGCG GCAATGGGTG ACGCCCGAGA TTGCGGCGCG GGTGACCGGC GAACGGGAAG CGGAAGCGAG GAGCTTTCCC GTTGCTCTCG AATTTCCCGT CGAGGCCTGG GTCAATCGCG CGATGAAGAA CTTCGTTGCC GATTACGGCA ACGCTTTCCA TGCGGCGAGC GGCTGGCTGC TCGCGCTTAT CGTGGCGCTG GAGGCCGGGC TCGGTGCCCT GCCCTGGTGG CTCATCATTC TCGCGGTGGC CGGCGCAGCC TGGCATGCCT CGCGCCATTT TGCGTTGCCC GTCATACTGG CGGGATTGCT GTTTCTGATC GGCACTCTCG GTCTCTGGGA CCTCGCGATC CAGACTTTGG CCCTGATGCT CGTGTCGGTC TTCTTCGCTG TGCTGATCGG GCTGCCTTCG GGGATTGCGA TCGCCGCCAG CGATATTGCT CGACGGATCG TTCTGCCGGT GCTCGACGCG ATGCAGACGC TACCAAGCTT CGTCTATCTC ATTCCGGCGC TGATGCTGTT CGGGCTCGGC AAGGTGCCTG CGCTCTTTGC AACCGTCATC TATGCAACGC CGCCGCTAAT CCGCCTGGTC GATCTGGGTT TGCGGACCGT GGACCGGCAA CTGGTGGAAA TGGCGGCTGA TCTGGGCGCA GACAAGTGGC GGCAGCTCAT CGACATCAAA TTGCCGCTGG CGCTGCCGAG CATCATGGCG GGCGTCAACC AGACGACGAT GATGGCGCTT TCCATGGTCG TCATCGCGTC GATGATCGGG GCGCGCGGGC TCGGCGAGGA AGTGCTGCTC GGCATCCAGC GGCTCGATAT CGGGCGCGGG CTGGCGGCGG GCATCGCGAT CGTCGCTCTC GCCATCGTGT TCGACCGGAT CACGCAGGCT TATGGGCGCG TGAACCGCAA AGACGCGCCG CCGGGGCGCT AG
|
Protein sequence | MLNRASLALA ALFLSVWGFP SGAAAAPSCA IDRPVMFGGL DWDSNAFHTE LARLIIERGY GCETDVLPGS TLPLVTGLGQ GDIDVLMEIW RDNVTEPWNE ALEAGTVVSL GTNFPDAVQG WYVPRYVIEG DAARGIEPMA PDLRSVSDLK QHAALFRDPE QPGKGRFYNC ILGWSCEEQS TRKLEGYGLD RFYTNFRPGT GAALAAAIAS AYERGEPVVA YYWGPTWVLG TYDLVKLEEP PYSEEAWNAF NADPKRNPPV AFPTVEVIIG ANARFAERAP EITAFLRAYE TTGQMTSEAL AYMQANSEAS ARDAALHFIE TRPDIWRQWV TPEIAARVTG EREAEARSFP VALEFPVEAW VNRAMKNFVA DYGNAFHAAS GWLLALIVAL EAGLGALPWW LIILAVAGAA WHASRHFALP VILAGLLFLI GTLGLWDLAI QTLALMLVSV FFAVLIGLPS GIAIAASDIA RRIVLPVLDA MQTLPSFVYL IPALMLFGLG KVPALFATVI YATPPLIRLV DLGLRTVDRQ LVEMAADLGA DKWRQLIDIK LPLALPSIMA GVNQTTMMAL SMVVIASMIG ARGLGEEVLL GIQRLDIGRG LAAGIAIVAL AIVFDRITQA YGRVNRKDAP PGR
|
| |