Gene Vapar_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2981 
Symbol 
ID7972257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3137555 
End bp3138592 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content65% 
IMG OID644793566 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002944867 
Protein GI239815957 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTTC GCCGCGACTT TATCAAGCTT TCCCTGGGTG CCGGCGTGGC CGGTGCCATG 
GCCCTGACGG CATTGCCGTC GTTCGCGCAG GGCGCGGCCC CCGTGACGCT GCTCAACGTG
TCGTACGACC CGACGCGCGA GCTGTACGTC GACTACAACC GCGCCTTCGC CAAATACTGG
AAGGGCAAGA CCGGCCAGGA CGTGACCATC AAGCAGTCGC ACGGCGGCTC GGGCAAGCAG
GCCCGCTCGA TCATCGACGG CATCGATGCC GACGTTGCCA CTCTGGCGCT GGGCGGCGAC
ATCGACGCGC TCGCCACGCA CGGCGGCCTC GTCAAGGCCG ACTGGCAAAA GCGCCTGCCG
CAGAACTCGG CGCCCTACAC CTCGACCATC GTGTTCCTTG TGAAGAAGGG CAATCCCAAG
GGCCTGAAGG ACTGGGACGA CCTCGTGAAA CCCGGCGTGC AGGTGATCAC GCCCAACCCC
AAGACCTCCG GCGGCGCGCG CTGGAACTAC CTGGCTGCCT GGGAATTCGC CAAGCGCAAG
TACGGCAGCG ACGCCAAGGC CAAGGAATAC ATCGGCAGCC TGTTCAAGAA CGTTCCGGTG
CTCGATGCCG GCGCGCGTGG CGCCACCATC ACCTTCGTGC AGCGCGGCGT GGGCGACGTG
CTGCTGGCCT GGGAGAACGA AGCCTTCCTG GCGCTGAAGG AATTCGGCGC CGAGAAGTTC
GAGATCGTGG TGCCGTCGAT CTCGATCCTG GCCGAGCCCA CCGTGGCGGT GGTCGACAAG
GTGGTCGACA AGAAGGGCAC CCGCGCGGTG GCCGAGGAAT ACCTCAAGTA CCTGTATTCG
GACGAAGGCC AGGACATTGC GGGCCGCAAC TTCTATCGCC CGACCTCGGA AAAGGCCAAG
GCCAAGTACG ACAAGCAGTT TCCCAAGCTC ACGCTGGTGA CCATCGACCA GGCCTTCGGC
GGCTGGGCCA AGGCCGACAA GGAGCACTTT GCCGACGGCG CTTCGTTCGA CCAGATCTAC
ACGGCCAAGC AGAAGTAA
 
Protein sequence
MSLRRDFIKL SLGAGVAGAM ALTALPSFAQ GAAPVTLLNV SYDPTRELYV DYNRAFAKYW 
KGKTGQDVTI KQSHGGSGKQ ARSIIDGIDA DVATLALGGD IDALATHGGL VKADWQKRLP
QNSAPYTSTI VFLVKKGNPK GLKDWDDLVK PGVQVITPNP KTSGGARWNY LAAWEFAKRK
YGSDAKAKEY IGSLFKNVPV LDAGARGATI TFVQRGVGDV LLAWENEAFL ALKEFGAEKF
EIVVPSISIL AEPTVAVVDK VVDKKGTRAV AEEYLKYLYS DEGQDIAGRN FYRPTSEKAK
AKYDKQFPKL TLVTIDQAFG GWAKADKEHF ADGASFDQIY TAKQK