Gene Vapar_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_1940 
Symbol 
ID7971119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp2074756 
End bp2077023 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content59% 
IMG OID644792539 
Productexopolysaccharide transport protein family 
Protein accessionYP_002943853 
Protein GI239814943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACTC CTGCTGTGAA CACTTCTAAT GCAGCGCGCA TCGGTCAGCC AGTCGCGCCT 
TCTACTCCAG GAGCGGGCGA TGACAACATT CACTTGTCGG AACTAATCGA TATCGTGCTC
GACCATAAGT GGTTGGTCAC AGCCATCACC GCACTTTCGC TGGTGCTCGG TTTGGTCTAC
ATACTGTTGT CTACTCCCGT TTACCAATCC AACCTATTAG TCCAGGTAGA GGATGCGGCG
GCTGATGCCA AGGGTTTTCT CGGTGAAACC TCCACCCTTT TTGACGTAAA GACGCCCGCC
ACGGGTGAGA TCCAGGTTAT TCGTTCGCGG ATGGTAATCG GCGCAGCAGT GGACCAGACG
CGCGCCTATA TCGAAGCCAA ACCGAATTAC CTGCCGATAC TCGGCCCATG GCTGGCTCGG
CGGGCCAGCG GACTTTCGGA GCCCGGCTTC CTAGGTATGA GCGGCTACGT CTCCGGCAAA
GAGAAAATCG ATGTCGCACG TTTTGACGTG CCGCCTGCGC TCGAAGACGA AGAGCCCTTC
GTGCTTACCG CTCAGGGTCA AGGCAAGTAC ACGCTCAGTC ACGACGCGCT GGACCAGCCG
CTGGCCGGCA CCGTTGGCCT GCCGCTGCAC CACGTGCTTG ATGACGGCGC CATTGACCTA
CTGATTGACA AGCTCGAAGG CAAGCCTGGC GCACAATTCG TTGTATCGCG CGCGTCGCAA
TTGCACACTA TCGAGGATTT GCAAAAGCGC CTTCAGCTCA TTGAACAGGG CAAGCAATCT
AACGTGATTA GCGCCGCGCT GGAAGACAGC GATCGCGACC GGCTCTCTCG CATTCTCAAT
GCTATTGGCG ATCAATACGT GCAGCAGAAC GTCGAACGTA AGGCGGCAGA GGCACAGAAA
ACACTTGTCT TCCTGAACGA GCAGTTGCCT GAGTTCAAGC GCCAACTCGA AGCCTCTGAA
GACGCCTACA CGCGTTTTCG CAATAAGAAC GGCACCGTGG CTTTCGATGA AGAAGCCAAG
GGCGTGCTGG CCCAAACCAT CGAACTGCAA ACGAAGCTTC TCGAAACCCA GCAGAAGCGC
CGAGAACTGG CCGCGCGTTT TACCGACAGC AACACTCGCG TGAAAACTAT CGACGGGCAG
ATCGCCGCTA TCGAAAAAGA AATCGGTGGT CTCAACGCCC GTGTGAGCCG CATGCCTACG
CTGCAACAGG ATGCGCTGCG CCTCGAGCGC GATGTGCGGG TGAACAGCGG GCTGTATCAG
TCGCTGCAGA ACAATGCGCT GCAGCTGCGT CTAGTTAAGG AAGGAAAGAC TGGGAACGTG
CGCGTGCTCG ACAAAGCAGT GAAGCCTAAA CAGCCAGTCA AACCGCAAAA GCCACTGATT
CTGGCGCTGG CGCTGGCGCT GGGCCTCCTC GTCAGCGGCG CGGCATTGGC AATCGTTCGA
AGCCGCTTCT TCACAGGCAT CCAGGATCCA CAAGAAATCG AGGCCCACAC CGGTCTTTCG
GTCTATTCAG TAGTGCCCTT TACGCCCGAA CAGACAGTGC TCGACCAAGG CGTTGCAACT
GGCGCCAAAG GCATCCAGCT ACTGGCGGTG ACCCATCCGG ACAGCCCGCC GATCGAGGCG
CTCCGCAGCT TGCGGATCGC GCTTCAGTTC GCCACGCTGG AAGCTGGCAA CAACCGCGTG
CTCATCACGG GTGCAACGCC AGGCATTGGC AAGAGCTTTG TTTCGGGCAA CTTCGCGGCC
ATCATGGCGC ATGCCGGCAA GCGCGTGCTC CTGATCGATG CCGACATGCG CAAAGGCCAC
TTGAACAAGC AATTTGGTCT GCCGCGTGAC GGTGGTCTTT CGGAGCTGTT GGCTGGTGAG
CTTTCGGCGC AACAGGCCAT TCGCGCGCAG GTGCTGCCCA ACCTCGACGT GCTGACCACC
GGCAAGCTGC CGACCAACCC GGCCGACATG CTGATGTCGG AAACCTTCAT CCGCACGCTC
GACATGCTCT CGGCGCAGTA TGAACTGGTC ATCATTGACA CCCCTCCGGT GCTGGTGGCT
GCTGACACAG CTGCCGTGGC GCCATACATG GGCGCAGTGC TGCTGGTAGC CCGGGCCGAT
CAAACCCAAC TTGGCGAACT CAACGAAAGC GCCAAGCGCC TCGCACATGC GGGCAAAGCG
GTCAGCGGCG TGATCTTCAA CGGCATCGAC CTGACGCGGC GCCACTACGG TAGCCATGGC
TACCGCTACG GTGGCTACAG GTACACCACG TACAAGTACA ACGAATAA
 
Protein sequence
MNTPAVNTSN AARIGQPVAP STPGAGDDNI HLSELIDIVL DHKWLVTAIT ALSLVLGLVY 
ILLSTPVYQS NLLVQVEDAA ADAKGFLGET STLFDVKTPA TGEIQVIRSR MVIGAAVDQT
RAYIEAKPNY LPILGPWLAR RASGLSEPGF LGMSGYVSGK EKIDVARFDV PPALEDEEPF
VLTAQGQGKY TLSHDALDQP LAGTVGLPLH HVLDDGAIDL LIDKLEGKPG AQFVVSRASQ
LHTIEDLQKR LQLIEQGKQS NVISAALEDS DRDRLSRILN AIGDQYVQQN VERKAAEAQK
TLVFLNEQLP EFKRQLEASE DAYTRFRNKN GTVAFDEEAK GVLAQTIELQ TKLLETQQKR
RELAARFTDS NTRVKTIDGQ IAAIEKEIGG LNARVSRMPT LQQDALRLER DVRVNSGLYQ
SLQNNALQLR LVKEGKTGNV RVLDKAVKPK QPVKPQKPLI LALALALGLL VSGAALAIVR
SRFFTGIQDP QEIEAHTGLS VYSVVPFTPE QTVLDQGVAT GAKGIQLLAV THPDSPPIEA
LRSLRIALQF ATLEAGNNRV LITGATPGIG KSFVSGNFAA IMAHAGKRVL LIDADMRKGH
LNKQFGLPRD GGLSELLAGE LSAQQAIRAQ VLPNLDVLTT GKLPTNPADM LMSETFIRTL
DMLSAQYELV IIDTPPVLVA ADTAAVAPYM GAVLLVARAD QTQLGELNES AKRLAHAGKA
VSGVIFNGID LTRRHYGSHG YRYGGYRYTT YKYNE