Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5468 |
Symbol | |
ID | 7975162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | - |
Start bp | 169202 |
End bp | 170809 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644796055 |
Product | protein of unknown function DUF894 DitE |
Protein accession | YP_002947329 |
Protein GI | 239820144 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.685954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATC TCGCAGATGC CGCAGAGGCC GCCCAGGCCG CGAAGCCGAG CAGCAGCTTT GCGCCGCTGC GCCAGCCGGT CTTTGCCGTG CTGTGGGCGG CCACCGTGCT CGGCAACATC GGCAGCTTCA TGCGCGACGT GGCGAGCTCC TGGCTCGTGA CCGACCTGTC GGCCAGCCCC ACGGCGGTGG CGCTGATCCA GACCGCGGCC ACGCTGCCGA TCTTCCTGCT CGCGATTCCG GCCGGGGTGC TGTCCGACAT CCTCGACCGG CGGCGCTTCC TGATCTTCGT GCAGCTGGTG CTGGCGGGCG TGAGCGGCAC GCTGCTGGTG CTCTCGCACA CCGGCGCGCT CACCGTCGAG TACCTGATCG CGCTGACCTT CGTCGGCGGC ATCGGCGCCG CGCTCATGGG GCCGACCTGG CAGTCGATCG TGCCCGAGCT GGTGCCGCGC GCCGACCTCA AGAACGCGGT GGCGCTGAAC TCGCTGGGCA TCAATATTGC GCGCTCCATC GGGCCGGCCG CGGGCGGCCT GATCCTCGCG AGCTTCGGCG CCGCCCTGAC CTATGGCGCC GACGTGCTCA GCTATGTGTT CGTGATCGCG GCGCTGCTGT GGTGGAAGCG CCCGGCGGCC GCCGACAGCG GCCTGTCGGA GAACTTCCTG GGCGCCTTCC GCGCCGGCCT GCGCTACACG CGCGCCAGCC GCGAGCTGCA CCGTGTGCTG CTGCGCGCGG CGGTGTTCTT CCTGTTCGCC AGTTCGGTGT GGGCGCTGCT GCCGCTGGTG GCGCGACAGA TGCTGGGCGG CAGCGCCGGC TTCTACGGCA TCCTGCTGGG CGCCGTGGGT GCGGGCGCCA TCGGCGGCGC GCTGGTGATG CCGCGGCTGC GCGCGCGCTT CAATGCCGAC GGCATGCTGC TGCTGGCCTC GCTGCTCACC GCCGGCGTGA TGGGCAGCCT GGTGTTCGCG CCGCCGCAGT GGCTCGCAGT GCTGTTGCTG CTGGTGCTGG GCCTGGGCTG GATCATCGCG CTCACCACGC TCAACGGCGT GGCGCAGTCG ATCCTGCCGA ACTGGGTGCG CGGGCGCGGC CTGGCCGTGT ACCTCACGGT GTTCAACGGC GCAATGGCGG CCGGCAGCCT GGGCTGGGGC CTGGTGGCGC AGGAGATCGG CGTGCCGTAC ACGCTGGTGG CGGGCGCCGC CGGGCTGGTG GTCGTGGCCC TGCTGTTCCA CCGTGCACGC CTGCCCACCG GCGATTCGGA CCTGCAGGCC TCGAACCACT GGCCCGAGCC ACTGGTGGCC GAGCCTGTCG CGCACGACCG CGGTCCCGTG ATGGTGCAGG TCGAGTACCG CATCCGCAAG GAAGACCGCC CGGCGTTCCT GGACGCGATG AAGCGGCTGT CGCTCGAGCG CCGCCGCGAC GGCGCCTACG CATGGGGCGT GACCGAGCAC ACCAGCGACC CCGAGCGCGT GATGGAGTGG TTCCTGGTCG AGTCCTGGGC CGAGCACCTG CGCCAGCACC ACCGCGTGTC GCATGCCGAC GCCGACCTGC AGAACGAAGC CGTGCGCTTT CACATCGGGC CCGGCCGGCC CGAGGTGCAC CACTTCCTGT CGCTCTGA
|
Protein sequence | MADLADAAEA AQAAKPSSSF APLRQPVFAV LWAATVLGNI GSFMRDVASS WLVTDLSASP TAVALIQTAA TLPIFLLAIP AGVLSDILDR RRFLIFVQLV LAGVSGTLLV LSHTGALTVE YLIALTFVGG IGAALMGPTW QSIVPELVPR ADLKNAVALN SLGINIARSI GPAAGGLILA SFGAALTYGA DVLSYVFVIA ALLWWKRPAA ADSGLSENFL GAFRAGLRYT RASRELHRVL LRAAVFFLFA SSVWALLPLV ARQMLGGSAG FYGILLGAVG AGAIGGALVM PRLRARFNAD GMLLLASLLT AGVMGSLVFA PPQWLAVLLL LVLGLGWIIA LTTLNGVAQS ILPNWVRGRG LAVYLTVFNG AMAAGSLGWG LVAQEIGVPY TLVAGAAGLV VVALLFHRAR LPTGDSDLQA SNHWPEPLVA EPVAHDRGPV MVQVEYRIRK EDRPAFLDAM KRLSLERRRD GAYAWGVTEH TSDPERVMEW FLVESWAEHL RQHHRVSHAD ADLQNEAVRF HIGPGRPEVH HFLSL
|
| |