Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_0533 |
Symbol | |
ID | 7972943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 593171 |
End bp | 594238 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644791136 |
Product | pentapeptide repeat protein |
Protein accession | YP_002942462 |
Protein GI | 239813552 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGCG CACTGCTGAC TCCGAAACTG CTGGCACTCG CCGTGACGCT GGGCGAGCCG ATGGAAGACA AGGACTTTGG CGCCGGCGCC TATGCGCGCG CGTACCTGGC CGGCGGCGTG TTCTCGCGCT GCCGGTTCGA CGAGGCCGAC CTGCGCGGCG CCGACCTGCG CGAGACGCTG TTCGACCAGT GCAGCTTCAA GGGCGCGCTG ATGGACGGCG CCAGCCTGCG CCGCGCCGTG CTCAACCGGT GCACCTTCGA CCATGCTGCG CTGCGCAGTG CCGACCTGCA CGGCGCGGTG CTGACCGACT GCGCGCTGCC GCATGCGCAA CTGACCCAGG CTGTGCTGTC GATGGCAACC GTCTCGAACT GCGACTTCGC CCATGCCGCG CTGATCGGAG CCGACCTGGA GTCTTCGACC TTCACGCGGT CGAACCTGGT GGACGTCAAC GCGGACGACA GCCGCTGGGT CCACACCTCG ATGCTCGCAT GCGGCTTGGC GCAAATGACC TGGGCGCGCG CCCGGATGCA GCGGGTCGTG TTCCACGAGG TCGACCTGCA GGGGAAATCC TTCGCCGGCC TGTCCCTCGA CGGTTGCCAG TTCGCGAACT GCAACCTCGG CGGCGCGAGC TTTCGCAACG CGCCGATGCG GCAATGCAAT TTCCAGGGCG CGCGCCTCGA TCGTGTCGAC TTCTCCGGCG CGCAAGGGCC GATGGCCGTG TTCTGCGATG CCCGGGGCGA AGCCGTCAAC TTCGCGGGTG CGGGGCTGCG GCAGGCGCTG TTCACGCGCA GCATGCTGCC CGGCGCGCGC TTCGATGGCG CCGACCTGCA CCAGTGCCAC TTTGCGGATG CGAAGCTGGC TGCCGCCTCG CTGCGCGACT GCGATCTGAG CTATGCCGAT TTCAGCCGGG CCGACCTGCA AAAGGTCGAC GGCCGCGGCG CCACCCTGTT GCGCACCGTG CTGCATCGCG CCGACACCGA GGACGCGCTG TGGACCGACC GCCCGCGGGC GCTGGAAACC GACGCGGCGT TGGCGCGCGC GGAACTCTGG AGGGGGCCTG CGCCATGA
|
Protein sequence | MPSALLTPKL LALAVTLGEP MEDKDFGAGA YARAYLAGGV FSRCRFDEAD LRGADLRETL FDQCSFKGAL MDGASLRRAV LNRCTFDHAA LRSADLHGAV LTDCALPHAQ LTQAVLSMAT VSNCDFAHAA LIGADLESST FTRSNLVDVN ADDSRWVHTS MLACGLAQMT WARARMQRVV FHEVDLQGKS FAGLSLDGCQ FANCNLGGAS FRNAPMRQCN FQGARLDRVD FSGAQGPMAV FCDARGEAVN FAGAGLRQAL FTRSMLPGAR FDGADLHQCH FADAKLAAAS LRDCDLSYAD FSRADLQKVD GRGATLLRTV LHRADTEDAL WTDRPRALET DAALARAELW RGPAP
|
| |