Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_1191 |
Symbol | |
ID | 7973360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 1300992 |
End bp | 1303244 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644791787 |
Product | exopolysaccharide transport protein family |
Protein accession | YP_002943108 |
Protein GI | 239814198 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.113829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAC CCCAGCAAGC CGCCCTGCCG ATGCCCGCGC TGGAAGAGGA GAACGACGGG TTCAAACTCG TCGAGTACCT GGACATCCTG ATCGACCACC GGTGGTTCAT TGCCATCGTG GTTGCCGTGG CCCTGCTGCT TGGCATGGCC TACGCCCTTT TCGGGCAGCC GATGTACGAA ACCAATGTGG CCGTGCAGGT CGAGGACTCG GGCAATTCGG CAGGCAGCTT CCTGGGCGAT GCGGCGTCGT CGCTGCTGAG CGTGAAGACG CCGGCCGCCG GCGAGATCGA GATCATCAAG TCGCGCGCCA TCCTCGGCCA GGCGGTGGAG AACACCAAGC TATACATCAG CGCGCAGCCG CGCTATGCAC CCATCGTGGG CAGCTGGCTC GCGCGGCGCG CCACCGAGCT GTCGAACCCG GGCTTCATGG GGCTTTCCGG CTACGTGACG GGGGCCGAGG CGATCATGGT TCCGCAATTC GACGTCCCCG CGGATCTCGA AGCGAAGCCG TTCATGCTGA CCCTCGGGCC GGACAACCGC TACGAACTGC TCGTTCCCAA CGTCGACACG CCGCTCAAGG GCACCGTCGG AACCCCGCTC GTGGCCAACG TTCCGGGCGG CAGCCTCCGG CTGCTGGTCA GCTCCATCAG TGCCAAGCCC GGCGCGCAGT TCCAGCTGGC CCGCAACTCC AGGCAACTGG CGCTGCTGTC GCTGCAGGAC AACCTCAAGG TCGTCGAGAA AGGCAAGCAG TCCGGCGTTC TCGACGTCAG CCTGAAAAGC CCGGATCCGG AAAAGCTGAC GCAGCTGCTC AATGAAATCG GGCGGCTCTA TGTGCGCCAG AACATCGAGC GCAAGGCTGC CGAGGCCGAG AAGACCCTGG GCTTCCTGGA CACCGCGCTG CCGCAGTTCA AGAAGCAGCT CGAGCAGTCC GAAGACCTCT ACAACCGCTA TCGAAACGAG AACGGCACCG TCAGCCTCGA CGACGAGGCC AAGAATGCGC TGGCCCAGAC GGTCGACCTG CAGTCCAAGC TGCTCGAAGC CGAACAGAAG CGGCGGGAGC TTTCGGCGCG CTTCACCGAC AAGCACCCGA ATCTCCAGAC GCTGGACGCG CAGATTTCGG CCTGGCGGAG CCAGATCTCG GCGGTCGATT CGCGCATCCG CAAGATGCCG CTGCTGCAGC AGAACACCGT TCAGATGCAG CGCGACATCA AGGTCAACAC CGACCTCTAC GTGTCGCTCC TGAACAGCTC GCTGCAGATG CGGCTGGCCA AGGAAGGCAA GGTCGGCAAC GTGCGCCTGC TGGACGACGC CATCATTCCC GAGGAGCCCG TCTGGCCCAA GCGGCCGCTG ATCATCGCCC TGGCGCTGCT GCTGGGCCTG GGGGCCGGCG TCGTGCTCGC CATTGCCAAG AACTCGCTGT TCGGCGGCAT CCGCAATCCG AGCGAGATCG AGATGCACAC GGGCCTCAAC GTGTACAGCA GCATCCCGTT GAGCCCGGCC CAGCGCACGA TCGACAAGAA CATCGAGAGC AGGGCGCCGG GAATGCACAT TCTCGCGCTC CAGCAATCGG AGGATCCCGC GGTCGAAAGC CTGCGCAGCC TGCGGACCGC ATTGCAGTTC GCCATGCTGG AGGCGCCCAA CAACCGCCTG CTCATCTCCG GCGCGACGCC GGGAGTCGGG AAGACCTTCG TCTCGGTCAA CTTCGCGGCC ATCACCGCTG CGTCAGGAAA GAAGGTCCTG CTGATCGACG CCGACCTGCG CAAGGGCCGG GTCAACCAGT TCTTCTCCCT TTCGCGCTCG TCCGGCCTTT CGGAATTGAT TGCCGGCACG CTCGGCTTCG AAAAAGCCAT TCGCTCGTCG ATCCTGCCGA ATCTGGACGT CATGACGACC GGCATGCTGC CGCCCAATCC GGCCGAATTG CTCATGAGCG ATTCTTTTTC CCAGATCCTG GAAAAGCTTT CGCCGGATTA CGACCTTGTG ATTATCGATA CCGCGCCGGT GCTGGTGGCG GCGGACACCG CATCGGTGGC GCCGCTCGCA GGCTCTCTTC TGCTCGTTGC CCGGGCCGAA AAGACGCACC TGGGCGAATT GAACGAAAGC GTCAGAAGAC TGGCGCACGC GGGCTGCTCG GCCAATGGCG TAATTTTGAA TGCTATGGAC TTATCCCGGC GCCATGCGGG CAGCAGCAGC TACAAATACG GTGGTTACCG TTACACACAC TATAAATACA AAAACAACAG GGATACCACC TAG
|
Protein sequence | MNAPQQAALP MPALEEENDG FKLVEYLDIL IDHRWFIAIV VAVALLLGMA YALFGQPMYE TNVAVQVEDS GNSAGSFLGD AASSLLSVKT PAAGEIEIIK SRAILGQAVE NTKLYISAQP RYAPIVGSWL ARRATELSNP GFMGLSGYVT GAEAIMVPQF DVPADLEAKP FMLTLGPDNR YELLVPNVDT PLKGTVGTPL VANVPGGSLR LLVSSISAKP GAQFQLARNS RQLALLSLQD NLKVVEKGKQ SGVLDVSLKS PDPEKLTQLL NEIGRLYVRQ NIERKAAEAE KTLGFLDTAL PQFKKQLEQS EDLYNRYRNE NGTVSLDDEA KNALAQTVDL QSKLLEAEQK RRELSARFTD KHPNLQTLDA QISAWRSQIS AVDSRIRKMP LLQQNTVQMQ RDIKVNTDLY VSLLNSSLQM RLAKEGKVGN VRLLDDAIIP EEPVWPKRPL IIALALLLGL GAGVVLAIAK NSLFGGIRNP SEIEMHTGLN VYSSIPLSPA QRTIDKNIES RAPGMHILAL QQSEDPAVES LRSLRTALQF AMLEAPNNRL LISGATPGVG KTFVSVNFAA ITAASGKKVL LIDADLRKGR VNQFFSLSRS SGLSELIAGT LGFEKAIRSS ILPNLDVMTT GMLPPNPAEL LMSDSFSQIL EKLSPDYDLV IIDTAPVLVA ADTASVAPLA GSLLLVARAE KTHLGELNES VRRLAHAGCS ANGVILNAMD LSRRHAGSSS YKYGGYRYTH YKYKNNRDTT
|
| |