Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3922 |
Symbol | |
ID | 7970351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4161907 |
End bp | 4163112 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644794508 |
Product | protein of unknown function DUF989 |
Protein accession | YP_002945802 |
Protein GI | 239816892 |
COG category | [S] Function unknown |
COG ID | [COG3748] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.222073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCT ATTACCTCGA CTGGGCCAAC CTGCTGCTGC GCTGGGTCCA CGTCATCACC GCCATCGCCT GGATCGGCGC CTCCTTCTAC TTCGTGATGC TGGACAACAG CCTCGAGAAG CCGCAAGACC CCGAGTCGCT CGACAAGGGC GTGGGCGGCG AGCAATGGGC CGTGCACGGC GGCGGCTTCT ACAACATGCA GAAGTACGCG CTGGCGCCCA AGCGGCTGCC GGACCACCTG CACTGGTCGT ACTGGGAGAG CTACAGCACC TGGCTCACGG GCTTTGCGCT CTTCACCATG TCGTACCTGT GGAACGCCAG CACCTACCTG ATCGACAAGT CGAAGATGGA CTGGCAGCCG GGCGCCGCCA TTGGCGTGGC GCTGGCTTTC TTCGTGGTGT TCTGGATGGT CTACGACGGC ATCTGCCAGA TCTTCGGCCG CCGCAAGAAC GGCGACACCA TCGTGGGCGT GCTGATCGCG CTCTTCATCG TGTTTGCGAC CTGGCTGGCC TGCCAGTGGT TCGCGGGCCG TGCGGCCTTC TTGCTGGTGG GCGCGATGAT GGCCACCACG ATGAGCGGCA ACGTGTTCTT CTGGATCATT CCGGGCCAGC GCAAGAACGT GCAGGCCCTG CGCGAAGGCC GGCCGGTCGA TCCGGTGCAC GGCCAGCGCG GCAAGCAGCG CAGCGTGCAC AACACCTACT TCACGCTGCC GGTGCTGTTC GCGATGCTGA GCAACCACTA CAGCTTCACC TACACGCACA AGTACAACTG GATCGTGCTG CTGCTGATCA TGCTCGGCGG CGCGGCCATT CGCCAGTTCT TCGTGGTGCG GCACCGCTTC AAGCTCGGCA ACGCGGGCAA CCCGCTGCCC TATGCGCTGA TCGGCATCGT GGTGCTGGGG CTCACCATCG TCTGGATGAA GCCCGAGCCG GCGGCCGCGC CTGTGGCTGC GGCAGCAGCG CCCGCGGTGG CATTCAAGGA CGTGCAGAAG GTGCTCGAGC AGCGCTGCTT CATGTGCCAT GGCGAGGCCG TGCAGATGAA GAACGTGCGC GTCGATTCGC CCGAGCAGGT GGCGGCGCAT GCGCAGGCCA TCTACCAGCA GGTGGTGGTC ACGAAGATCA TGCCGATGAA CAACGCCACC GGCATCACCG ACGAAGAACG CGCGCTGATC GGCCGCTGGT TCCAGGCCGG CGCCAAGACC AACTAG
|
Protein sequence | MESYYLDWAN LLLRWVHVIT AIAWIGASFY FVMLDNSLEK PQDPESLDKG VGGEQWAVHG GGFYNMQKYA LAPKRLPDHL HWSYWESYST WLTGFALFTM SYLWNASTYL IDKSKMDWQP GAAIGVALAF FVVFWMVYDG ICQIFGRRKN GDTIVGVLIA LFIVFATWLA CQWFAGRAAF LLVGAMMATT MSGNVFFWII PGQRKNVQAL REGRPVDPVH GQRGKQRSVH NTYFTLPVLF AMLSNHYSFT YTHKYNWIVL LLIMLGGAAI RQFFVVRHRF KLGNAGNPLP YALIGIVVLG LTIVWMKPEP AAAPVAAAAA PAVAFKDVQK VLEQRCFMCH GEAVQMKNVR VDSPEQVAAH AQAIYQQVVV TKIMPMNNAT GITDEERALI GRWFQAGAKT N
|
| |