Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_1940 |
Symbol | |
ID | 7971119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 2074756 |
End bp | 2077023 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644792539 |
Product | exopolysaccharide transport protein family |
Protein accession | YP_002943853 |
Protein GI | 239814943 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACTC CTGCTGTGAA CACTTCTAAT GCAGCGCGCA TCGGTCAGCC AGTCGCGCCT TCTACTCCAG GAGCGGGCGA TGACAACATT CACTTGTCGG AACTAATCGA TATCGTGCTC GACCATAAGT GGTTGGTCAC AGCCATCACC GCACTTTCGC TGGTGCTCGG TTTGGTCTAC ATACTGTTGT CTACTCCCGT TTACCAATCC AACCTATTAG TCCAGGTAGA GGATGCGGCG GCTGATGCCA AGGGTTTTCT CGGTGAAACC TCCACCCTTT TTGACGTAAA GACGCCCGCC ACGGGTGAGA TCCAGGTTAT TCGTTCGCGG ATGGTAATCG GCGCAGCAGT GGACCAGACG CGCGCCTATA TCGAAGCCAA ACCGAATTAC CTGCCGATAC TCGGCCCATG GCTGGCTCGG CGGGCCAGCG GACTTTCGGA GCCCGGCTTC CTAGGTATGA GCGGCTACGT CTCCGGCAAA GAGAAAATCG ATGTCGCACG TTTTGACGTG CCGCCTGCGC TCGAAGACGA AGAGCCCTTC GTGCTTACCG CTCAGGGTCA AGGCAAGTAC ACGCTCAGTC ACGACGCGCT GGACCAGCCG CTGGCCGGCA CCGTTGGCCT GCCGCTGCAC CACGTGCTTG ATGACGGCGC CATTGACCTA CTGATTGACA AGCTCGAAGG CAAGCCTGGC GCACAATTCG TTGTATCGCG CGCGTCGCAA TTGCACACTA TCGAGGATTT GCAAAAGCGC CTTCAGCTCA TTGAACAGGG CAAGCAATCT AACGTGATTA GCGCCGCGCT GGAAGACAGC GATCGCGACC GGCTCTCTCG CATTCTCAAT GCTATTGGCG ATCAATACGT GCAGCAGAAC GTCGAACGTA AGGCGGCAGA GGCACAGAAA ACACTTGTCT TCCTGAACGA GCAGTTGCCT GAGTTCAAGC GCCAACTCGA AGCCTCTGAA GACGCCTACA CGCGTTTTCG CAATAAGAAC GGCACCGTGG CTTTCGATGA AGAAGCCAAG GGCGTGCTGG CCCAAACCAT CGAACTGCAA ACGAAGCTTC TCGAAACCCA GCAGAAGCGC CGAGAACTGG CCGCGCGTTT TACCGACAGC AACACTCGCG TGAAAACTAT CGACGGGCAG ATCGCCGCTA TCGAAAAAGA AATCGGTGGT CTCAACGCCC GTGTGAGCCG CATGCCTACG CTGCAACAGG ATGCGCTGCG CCTCGAGCGC GATGTGCGGG TGAACAGCGG GCTGTATCAG TCGCTGCAGA ACAATGCGCT GCAGCTGCGT CTAGTTAAGG AAGGAAAGAC TGGGAACGTG CGCGTGCTCG ACAAAGCAGT GAAGCCTAAA CAGCCAGTCA AACCGCAAAA GCCACTGATT CTGGCGCTGG CGCTGGCGCT GGGCCTCCTC GTCAGCGGCG CGGCATTGGC AATCGTTCGA AGCCGCTTCT TCACAGGCAT CCAGGATCCA CAAGAAATCG AGGCCCACAC CGGTCTTTCG GTCTATTCAG TAGTGCCCTT TACGCCCGAA CAGACAGTGC TCGACCAAGG CGTTGCAACT GGCGCCAAAG GCATCCAGCT ACTGGCGGTG ACCCATCCGG ACAGCCCGCC GATCGAGGCG CTCCGCAGCT TGCGGATCGC GCTTCAGTTC GCCACGCTGG AAGCTGGCAA CAACCGCGTG CTCATCACGG GTGCAACGCC AGGCATTGGC AAGAGCTTTG TTTCGGGCAA CTTCGCGGCC ATCATGGCGC ATGCCGGCAA GCGCGTGCTC CTGATCGATG CCGACATGCG CAAAGGCCAC TTGAACAAGC AATTTGGTCT GCCGCGTGAC GGTGGTCTTT CGGAGCTGTT GGCTGGTGAG CTTTCGGCGC AACAGGCCAT TCGCGCGCAG GTGCTGCCCA ACCTCGACGT GCTGACCACC GGCAAGCTGC CGACCAACCC GGCCGACATG CTGATGTCGG AAACCTTCAT CCGCACGCTC GACATGCTCT CGGCGCAGTA TGAACTGGTC ATCATTGACA CCCCTCCGGT GCTGGTGGCT GCTGACACAG CTGCCGTGGC GCCATACATG GGCGCAGTGC TGCTGGTAGC CCGGGCCGAT CAAACCCAAC TTGGCGAACT CAACGAAAGC GCCAAGCGCC TCGCACATGC GGGCAAAGCG GTCAGCGGCG TGATCTTCAA CGGCATCGAC CTGACGCGGC GCCACTACGG TAGCCATGGC TACCGCTACG GTGGCTACAG GTACACCACG TACAAGTACA ACGAATAA
|
Protein sequence | MNTPAVNTSN AARIGQPVAP STPGAGDDNI HLSELIDIVL DHKWLVTAIT ALSLVLGLVY ILLSTPVYQS NLLVQVEDAA ADAKGFLGET STLFDVKTPA TGEIQVIRSR MVIGAAVDQT RAYIEAKPNY LPILGPWLAR RASGLSEPGF LGMSGYVSGK EKIDVARFDV PPALEDEEPF VLTAQGQGKY TLSHDALDQP LAGTVGLPLH HVLDDGAIDL LIDKLEGKPG AQFVVSRASQ LHTIEDLQKR LQLIEQGKQS NVISAALEDS DRDRLSRILN AIGDQYVQQN VERKAAEAQK TLVFLNEQLP EFKRQLEASE DAYTRFRNKN GTVAFDEEAK GVLAQTIELQ TKLLETQQKR RELAARFTDS NTRVKTIDGQ IAAIEKEIGG LNARVSRMPT LQQDALRLER DVRVNSGLYQ SLQNNALQLR LVKEGKTGNV RVLDKAVKPK QPVKPQKPLI LALALALGLL VSGAALAIVR SRFFTGIQDP QEIEAHTGLS VYSVVPFTPE QTVLDQGVAT GAKGIQLLAV THPDSPPIEA LRSLRIALQF ATLEAGNNRV LITGATPGIG KSFVSGNFAA IMAHAGKRVL LIDADMRKGH LNKQFGLPRD GGLSELLAGE LSAQQAIRAQ VLPNLDVLTT GKLPTNPADM LMSETFIRTL DMLSAQYELV IIDTPPVLVA ADTAAVAPYM GAVLLVARAD QTQLGELNES AKRLAHAGKA VSGVIFNGID LTRRHYGSHG YRYGGYRYTT YKYNE
|
| |