Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2893 |
Symbol | |
ID | 7970782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 3038770 |
End bp | 3040416 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644793478 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002944779 |
Protein GI | 239815869 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.3428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACG ACGAAGCCAG ACCCCCCAAC GATCCATCGG CCCCGCCCGG GCTTCCTGCG CCGCGCGTGG TGCGCCGGCG CGAATGGCTG CCCTCGCTGA TCTGGCTGAT CCCCATCGTT GCCGCGCTGG TCGGCGTGAT GCTGGTGGTC CGGATCCTGA TGCAGCGCGG GCCCGAGATC GTGCTCACCT TCAACACGGC CGAAGGCCTG GAGGCCAACA AGACGGCCGT CAAGTACAAG GACGTGCAGA TCGGCACGGT GCAGAGCCTG AAGCTCGCGC GCGACCGCTC GCATGTGCGC GTGACCGTGC AGCTCAGCAA GGAAGCCGAG AGCTTCACCG CCGAGGATTC GCGCTTCTGG GTGGTGCGGC CACGGCTCGA CACCTCGGGC ATCTCGGGCC TGGGCACCTT GCTGTCGGGC GCCTACATCG GCGCCGACGC GGGCGTGTCG AACGAGACCG CGAGCGAGTT CAAGGGACTG GAGGTGCCGC CGATCGTCAC GCGCGACGCC TCGGGCCAGC AGTTCCTGCT GCGCGCCACC GACGTGGGCT CGCTCGACGT GGGCTCGCCG GTGTATTTCC GGCGCATCAA GGTGGGCCAG GTGGCCGCCT ACGAACTCGA TGGCGACGGC CGCGGCGTGA CGCTGCGCAT CTTCGTGAAT GCGCCCTACG ACAAGTTCGT GGGCGTCAAC ACGCGCTTCT GGCAGGCCAG CGGCATCGAC GCGCAGCTCA GCGCCAGCGG CTTCACGCTG CGCACGCAGT CGCTCGCCAC CATCCTGCTC GGCGGCATCG CCTTCCAGGC GCCCGACGAC GCCATGGGCC CGCTGGCCAA GGAGAACACG GCCTTCACGC TGGCGCAGGA CGAAACCGCG GCCATGAAGG AGCCCGACGG CCCGCCGCAG ACGCTGCTGA TGTACTTCAA CCAGTCGCTG CGCGGCCTCA CGCCGGGGGC GCCGGTCGAT TTCCGCGGCG TGGTGATCGG CGAGGTCAAG TCGATCGGCG TGGAGTTCGA CCGCGCCGAG CGCGAGTTCC GCATGCCGGT GCTGGTGCAG GTCTATCCGG ACCGGCTGCG CCGCCGCGCG GGCGAGAGCG GCGTGGAGTC GCGCGCCACC CAGCAGGAGC GGCTGCGCTT CCTGGCCGAG AAGGGGCTGC GCGCGCAGCT GCGCAACGGC AACCTGCTGA CCGGGCAGGT GTACGTGGCG CTCGACTTCT TTCCCAAGGC GCCGCCGGCC CGCATCGACG TGACGAAGAA CCCGATCGAG CTGCCCACCA TCGCCAACAG CCTCGACGAG ATCCAGTCGC AGGTGCAGGA GATCGCAAGC AAGCTCAACA AGGTGCCCTA CGAGCAGATT GCGGCCGACC TGCGCACCAC GCTCGCCTCG CTCAACAAGA CGCTGGCCAG CACCGAGCAG GCGGTGAACC GCATCAACAC CGACCTGACG CCCGAACTGG CCGCCGCCAT GAAGGACGTG CGCAAGACCG TCAACAGCGC CGAGCGCACG CTGGCCGACG ACTCTCCGCT GCAGCAGGAC ATGCGCCAGA CGCTGCGCGA ACTCACGCGC GCCGCGGGCT CGGTGCGCGT GCTGACCGAC TACCTCGAGC GGCACCCCGA ATCGCTCCTG CGCGGCAAAC CGGACGACAA GAAATGA
|
Protein sequence | MSDDEARPPN DPSAPPGLPA PRVVRRREWL PSLIWLIPIV AALVGVMLVV RILMQRGPEI VLTFNTAEGL EANKTAVKYK DVQIGTVQSL KLARDRSHVR VTVQLSKEAE SFTAEDSRFW VVRPRLDTSG ISGLGTLLSG AYIGADAGVS NETASEFKGL EVPPIVTRDA SGQQFLLRAT DVGSLDVGSP VYFRRIKVGQ VAAYELDGDG RGVTLRIFVN APYDKFVGVN TRFWQASGID AQLSASGFTL RTQSLATILL GGIAFQAPDD AMGPLAKENT AFTLAQDETA AMKEPDGPPQ TLLMYFNQSL RGLTPGAPVD FRGVVIGEVK SIGVEFDRAE REFRMPVLVQ VYPDRLRRRA GESGVESRAT QQERLRFLAE KGLRAQLRNG NLLTGQVYVA LDFFPKAPPA RIDVTKNPIE LPTIANSLDE IQSQVQEIAS KLNKVPYEQI AADLRTTLAS LNKTLASTEQ AVNRINTDLT PELAAAMKDV RKTVNSAERT LADDSPLQQD MRQTLRELTR AAGSVRVLTD YLERHPESLL RGKPDDKK
|
| |