Gene Vapar_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_1407 
Symbol 
ID7971235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp1528680 
End bp1529786 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID644792005 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002943324 
Protein GI239814414 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCG GGCCCGCCCG CACCCACGCG CCGCTGATCA TCGACGTGGC GGCGACCGAA 
CTGAACGACG CCGACCGTCG GCGCATCGCC AATCCGCTGG TCGGCGGCGT GATCCACTTC
GCCCGCAACT GGCAGGACCG TGCGCAGATG AGCGCGCTCA ATGCCGAGAT CAAGGCGATC
CGCCCCGACC TGCTGATCTG CGTGGACCAC GAAGGCGGCC GGGTGCAGCG CTTTCGCACC
GACGGCTTCA CGCGCCTGCC CTCGATGCGC GCGCTCGGCG AGCTGTGGAT GCGCGATGCG
ATGCGCGCCA CGCAGGCCGC AACGGCCGCC GGCCAGGTGC TTGCCGCCGA ACTGCGCGCC
TGCGGCGTCG ACTTCAGCTT TGCGCCGGTG CTCGACCTCG ATTACGGCGG CAGCAGCGTG
ATCGGCGACC GCAGCTTCCA CCGCGACCCG CGCGTGGCGG CGCTGCTGGC CAAGAGCCTG
ATGCACGGCA TGCTGCAGAT GGGCATGCGC AACTGCGGCA AGCACTTTCC GGGGCACGGC
TTCGTCACCG CCGATTCGCA TGTGGAAATC CCGGTCGACC GGCGCAGCCT CAAGGCCATT
CTGGCCGACG ATGCGCGGCC CTACGACTGG CTCGCGGGCA CGCTCACCGC AGTGATGCCG
GCGCACGTGA TCTATCCGAA GGTCGACAAG CGGCCCGCGG GCTTTTCTTC CAGGTGGCTG
GAGGACATCC TGCGCAGCAG GCTCGGTTTC GATGGCGCCA TCTTCAGCGA TGACCTGAGC
ATGGAAGCAG GGCGCTACAT CGACGGCGAG CTGCTGAGCT ATGCCGATGC CGCGCTGGCG
GCACTCGACG CGGGCTGCGA CCTGGCGATG CTGTGCAACC AGAGCATCGG CGATGGCCGC
CCGCTCGACG AACTGCTCGA CGGCTTTGCC GCCGCGGCCA GTGCGGGGCG CTGGCAGCCC
GATGCGAACA GCGAGGCGCG CCGCCGGGCG CTGCTGCCGG AAACGCCGCC GCTCGGCTGG
AACGCGCTCG CCGATTCGGC GGGCTATCGG CAGGCGAGGC AAGTACTTGC GCAGGCCGGG
CTTGCGGCGC CTCGACAGGG CGCCTGA
 
Protein sequence
MTAGPARTHA PLIIDVAATE LNDADRRRIA NPLVGGVIHF ARNWQDRAQM SALNAEIKAI 
RPDLLICVDH EGGRVQRFRT DGFTRLPSMR ALGELWMRDA MRATQAATAA GQVLAAELRA
CGVDFSFAPV LDLDYGGSSV IGDRSFHRDP RVAALLAKSL MHGMLQMGMR NCGKHFPGHG
FVTADSHVEI PVDRRSLKAI LADDARPYDW LAGTLTAVMP AHVIYPKVDK RPAGFSSRWL
EDILRSRLGF DGAIFSDDLS MEAGRYIDGE LLSYADAALA ALDAGCDLAM LCNQSIGDGR
PLDELLDGFA AAASAGRWQP DANSEARRRA LLPETPPLGW NALADSAGYR QARQVLAQAG
LAAPRQGA