Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3117 |
Symbol | |
ID | 7974327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 3275422 |
End bp | 3276351 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644793702 |
Product | protein TolA |
Protein accession | YP_002945002 |
Protein GI | 239816092 |
COG category | [A] RNA processing and modification |
COG ID | [COG5178] U5 snRNP spliceosome subunit |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.321745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTCG CACTGGATCG CCCCGAGTTC GCACCACCGC CGCAGCGCGG CACGCCGCGC GCGGTGCTGC TCGCGCTGGT TGCGCACGCG CTGCTGATTG CCGCGCTGAC GTGGGGAGTG CGCTGGCGCA GCGATGCCGA CGAAGGCGCC GTCGATGCGG AGCTGTGGTC TTCCACGGTG CAGCAGGCTG CGCCGCGCCT GTCGGCGCCG CAGGCACCCA CGCCCGCTCC CGCGCCTCCG CCGCCTGCGC CTCCTCCGCC GCCACCGCCG CCTCAGGTGA AGGCCCCCGA GCCCGCACCG CCGCCGCGCG CGCCGGACAT CGCCCTCGAG CGCGAGAAAA AGCTCAAGGA AGAAAAGGAA CAGAAGGAGC GCGAGCTCGA GCGCCAGCAA CAGCAGCGCA AGAAGGAGCT CGAAGCCAAG CAGCGCGCCG AAGACGAAGC CGAACGCAAG AAGGCGCAGC AGCAAAAGCT CGCCGAACAG CAGAAGAAAC AGCAGCAGGA GGCCGAGGCC AAGCAGGCCG AAGCGAAGAA GCAGCAGGAA GCCGCAGCCA AGCAGGCCGC GGCCGACCGC GCCGCAACGC TCAAGCGCAT GCAGGGCCTC GCGGGCGCGA GCGGCAGCGA CGATTCCAAG GGCACGGCAA TGCGCTCGTC AGGACCGTCG AGCGGCTACG CCGGACGCAT TGCCGCGGCG GTGCGCCCCA ACATCACCTT CCCCGATGCC GAGACGGTCA ACGGCAATCC GGCGGCCGAG TTCGAGGTGA ACCTGGCACC GGACGGCACC ATCGTCGGCG TCAAGCTGAC CAAGTCGAGC GGACTGCCCA GCTGGGACGA AGCCGCCGAA CGCGGCCTGC ACAAGACCGA CAAGCTGCCG CGCGACACCG ACGGGCGCAT CTTCCCGTCG CTGATCGTCT CGCTGCGGCC CAAGCGGTAG
|
Protein sequence | MSLALDRPEF APPPQRGTPR AVLLALVAHA LLIAALTWGV RWRSDADEGA VDAELWSSTV QQAAPRLSAP QAPTPAPAPP PPAPPPPPPP PQVKAPEPAP PPRAPDIALE REKKLKEEKE QKERELERQQ QQRKKELEAK QRAEDEAERK KAQQQKLAEQ QKKQQQEAEA KQAEAKKQQE AAAKQAAADR AATLKRMQGL AGASGSDDSK GTAMRSSGPS SGYAGRIAAA VRPNITFPDA ETVNGNPAAE FEVNLAPDGT IVGVKLTKSS GLPSWDEAAE RGLHKTDKLP RDTDGRIFPS LIVSLRPKR
|
| |