Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2519 |
Symbol | proX |
ID | 7969996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 2659361 |
End bp | 2660371 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644793106 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_002944411 |
Protein GI | 239815501 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.395168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA GAACCCGTCT CATCCTTGCC CGCGGCCTGC TGGCCCTCGG TTTCGCCGCC TTCGGCATGG GCGCACAGGC CGCCAATGAC CTGCCCGGCC AGGGCGTCAC CGTGCAGCCG CTCAAGAGCT CGCTCGCCGA AGAGGCGTTC CAGACGCTGC TCGTGATGCG CGCGCTCGAG AAGCTCGGCT ACACGGTGGA GCCCATGAAG GACCTCGAAC CCGCCACCGA GCACCTGGCC ATCGCCAATG GCGACGCCAC CTTCATGGCC AACCACTGGA GCCTGCTGCA CGCCGACTTC TACAAGAACA GCGGCGGCGA CGCCAAGCTG TGGCGCAAGG GCGTGTACTC GGACGGCGCG GTGCAGGGCT ACCTGATCGA CCGCAAGACG GCCGAGCAGT ACAACATCCG CAGCATCGCG CAGCTGAAAG ACCCGGCCAT CGCCAGGCTG TTCGATGCCG ACGGCGACGG CAAGGCCGAC CTGACGGGCT GCAACCCCGG CTGGGGCTGC GAACTGGCGA TCGAGAACCA CCTGACGGCC TACCAGCTGC GCGACACCGT CACGCACAAG CAGGGCAGCT ATGCCGCGCT GATGGCCGAC ACCATCGCCC GCTTCAAGCA GGACAAGCCG GTGCTGTACT ACACCTGGAC GCCCTACTGG GTCAGCGCGG TGCTGCGGCC CGGTGCGGAC GTGGTGTGGC TGCAGGTGCC CTTCTCGTCC TCCCAGGGCG GCAACGCGGA CACGCAGCTT CCCAACGGCA AGAACTACGG CTTCCAGGCC AACCAGGAGC AGATCGTCGC CAACCGGGCC TTCGTCGAGA AAAACCCGGC CGCCGGCCGG CTGTTCGAGG TGATGAAACT GCCGATCGGC GACATCAACG CCCAGAACCT GCGCATGAGC CAGGGCGCCA ACACGCAGCA AGACCTGGAG CGCCACACTG ACGGCTGGAT CCGGGCGCAC CGGCCGCTGT TCGATGGCTG GATCGAGCAA GCCCGGGCCG CCGCAAGGTA G
|
Protein sequence | MKKRTRLILA RGLLALGFAA FGMGAQAAND LPGQGVTVQP LKSSLAEEAF QTLLVMRALE KLGYTVEPMK DLEPATEHLA IANGDATFMA NHWSLLHADF YKNSGGDAKL WRKGVYSDGA VQGYLIDRKT AEQYNIRSIA QLKDPAIARL FDADGDGKAD LTGCNPGWGC ELAIENHLTA YQLRDTVTHK QGSYAALMAD TIARFKQDKP VLYYTWTPYW VSAVLRPGAD VVWLQVPFSS SQGGNADTQL PNGKNYGFQA NQEQIVANRA FVEKNPAAGR LFEVMKLPIG DINAQNLRMS QGANTQQDLE RHTDGWIRAH RPLFDGWIEQ ARAAAR
|
| |