Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_0204 |
Symbol | |
ID | 7971413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 216004 |
End bp | 217080 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644790807 |
Product | pentapeptide repeat protein |
Protein accession | YP_002942133 |
Protein GI | 239813223 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.550889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCGG CCAGTGTGCA GGCGATGATC CATGGCGGAG AAACGATCGC TCGGAAGCAA CTTCAAGCGC TCGACTTTCG GGGCATGGAC CTCACGGGTG CCATTTTCCT GGAGACCGAT CTGAGTGATG CCGATCTGTC CGGTGCCAAG CTCGATGCGA GCATCTTCAA GCAGTGCAAG CTTGCCGGGG CCCGGCTTGA TGCCGCGCAT GTGGACCGTT GTGCATTCGA GCATTGCGAT GCAACCGGGA TCCGGATGGT CGGCGCCACG CTCGACGGTG CCGCCTTCTA CCAATGTGGA CTGGCCGGCG CAAAGCTCGA TGACGCAAAT GCCGGCGTTG CCTCTTTCGC CAACTGCACG CTCGACCGCG CCTCGTTCGT CAATGTGCTG TCGGAAGCCC TGGTGTTCTC CGAGACCTCG CTCGATGCCA CCGACTTTTC CACGACGCTC CTGCACAAGA CAGTGTTCTA CCGGGCAGAT CTTCGATCGG CAATCTTCCG TGGCAGCCGG TTCGACAAGG CTGTCTTCTC CGAGGCCAAG CTCTCGGGAC AGAGATTCAA TGCGCAGAAG TTCCTGCTTT GCCAGTTCAT CGACGCAGAA CTCGACGGAT GCGATTTTGA CGAGTCCATG CTTGTGCAAT GCAACTTCAA GGGCGCCAAG CTCGAAGGTG CGAGCCTGAA CCGGGTCCAT GCGCCCCAAT GCCTCTTTCC GTCGGCCAAG CTCGATGGCG TCGGTTGCCG CGGCGCCCAT TTCAATCAAA GCATCTGGGT CGAGGTGCAG GCGCGCGGAG CCGATTTCTC CGACGCGAAA CTGGAGCAGT GCATCTTCCA GCGTGCCAGC TGCAGCCATG CCGTCTTCGC AAGGGCGGAC CTTACATACG CCGATTTTTC TCACGCCGAT CTGCGTGGTG CCGATCTGCG GGGCGCTCGC TTCCTGCGCA CCAAGGTTCA CCGCGCGCAA CAGGGGGGAA CGAGATATGG CGACCGTGGT GGCTTGCTGT TGAACGACCC CGACCTGTTT GCCGCCGAGG AGTTCTCGGC ACGCCGCCTC GCGCGTCCGG GCGCATATCC CTCTTGA
|
Protein sequence | MTPASVQAMI HGGETIARKQ LQALDFRGMD LTGAIFLETD LSDADLSGAK LDASIFKQCK LAGARLDAAH VDRCAFEHCD ATGIRMVGAT LDGAAFYQCG LAGAKLDDAN AGVASFANCT LDRASFVNVL SEALVFSETS LDATDFSTTL LHKTVFYRAD LRSAIFRGSR FDKAVFSEAK LSGQRFNAQK FLLCQFIDAE LDGCDFDESM LVQCNFKGAK LEGASLNRVH APQCLFPSAK LDGVGCRGAH FNQSIWVEVQ ARGADFSDAK LEQCIFQRAS CSHAVFARAD LTYADFSHAD LRGADLRGAR FLRTKVHRAQ QGGTRYGDRG GLLLNDPDLF AAEEFSARRL ARPGAYPS
|
| |