Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5203 |
Symbol | |
ID | 7969862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 5526705 |
End bp | 5527703 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644795797 |
Product | hypothetical protein |
Protein accession | YP_002947071 |
Protein GI | 239818161 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.65094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCGG GGCGCAGGCA TTTCGCCGCC GGGCTGGCGG CTGGTGCGGC GGCCCTGCTG CTTGCCGGCA CGGCCGGCTT GGCGCAGGCC GCGGCCTGGC CCGACAAGCC CGTGAAGCTG GTGGTGCCGT TCCCGCCCGG GCAGGCCACC GACATCTTCG CGCGCGCACT CGCCGAGCAG CTCGGCAAGC GGCTGGGCCA GCCGGTGATC GTCGACAACA AGGCCGGTGC CGGCAGCAAC ATCGGCACCG AGTTCGTGGT GCGCGCGCCG GCCGACGGCT ACACGCTGGT GGTGGCCGGC AGCGCGATGG CGGTGAACCA GACGCTCTAC GCCAAGCCGG GCTTCGATCC GCGCAAGGAC CTGGTGGGCA TCTCGCTCAT CGCCACGGTG CCGCTGGTGT TCCTGGCCAC GCCCGAGAGC GGCATCCGCA GCATGGCCGA CCTGGCGGCG CACGCCAAGG CCGAACCGGG CCGGCTGAGC TATGCGAGCG CCGGCATCGG CGGCACGCAG CACCTGTCGG GCGAGATGTT CAAGTCGGCC GCGCGCGTCT TCATCACCCA CATTCCGTAC CGCGGCAGCG GGCCGGCGCA GTCGGACTTC CTGGGCAACC AGATTCCGCT GATGGTCGAC TCGGTGACGG CCGCGCTGCC GCACATCAAG TCGGGCAGGG CGGTGGCGCT GGCGGTGACC TCGGCCAAGC GCTCCTCGCA GCTGCCTGAC GTGCCCACCG TGCGCGAAAG CGGCGTGGCC GGCACCAGGG ACTTCGAGGC CGTGGGCTGG CTCGGCCTGA TGGCGCCGCG CGGCACGCCG CCCGAGATCA CGGCGCGGCT GAACCAGGAA GTGACCGACA TCCTCAAGAG CGAGCAGATG GCGCGCTTCA TCCGCGACCG CGGCTCCGAG CCCGCGCCCA CCACGGGCGC CGAGTTCGAC CGCTTCGTGG CGAACGAGAT CCAGCGCTGG GGCGGCGCGG TGAAGGCCTC GGGCGCCAAG CCCGAGTAG
|
Protein sequence | MRAGRRHFAA GLAAGAAALL LAGTAGLAQA AAWPDKPVKL VVPFPPGQAT DIFARALAEQ LGKRLGQPVI VDNKAGAGSN IGTEFVVRAP ADGYTLVVAG SAMAVNQTLY AKPGFDPRKD LVGISLIATV PLVFLATPES GIRSMADLAA HAKAEPGRLS YASAGIGGTQ HLSGEMFKSA ARVFITHIPY RGSGPAQSDF LGNQIPLMVD SVTAALPHIK SGRAVALAVT SAKRSSQLPD VPTVRESGVA GTRDFEAVGW LGLMAPRGTP PEITARLNQE VTDILKSEQM ARFIRDRGSE PAPTTGAEFD RFVANEIQRW GGAVKASGAK PE
|
| |