Gene Vapar_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3117 
Symbol 
ID7974327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3275422 
End bp3276351 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content70% 
IMG OID644793702 
Productprotein TolA 
Protein accessionYP_002945002 
Protein GI239816092 
COG category[A] RNA processing and modification 
COG ID[COG5178] U5 snRNP spliceosome subunit 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.321745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTCG CACTGGATCG CCCCGAGTTC GCACCACCGC CGCAGCGCGG CACGCCGCGC 
GCGGTGCTGC TCGCGCTGGT TGCGCACGCG CTGCTGATTG CCGCGCTGAC GTGGGGAGTG
CGCTGGCGCA GCGATGCCGA CGAAGGCGCC GTCGATGCGG AGCTGTGGTC TTCCACGGTG
CAGCAGGCTG CGCCGCGCCT GTCGGCGCCG CAGGCACCCA CGCCCGCTCC CGCGCCTCCG
CCGCCTGCGC CTCCTCCGCC GCCACCGCCG CCTCAGGTGA AGGCCCCCGA GCCCGCACCG
CCGCCGCGCG CGCCGGACAT CGCCCTCGAG CGCGAGAAAA AGCTCAAGGA AGAAAAGGAA
CAGAAGGAGC GCGAGCTCGA GCGCCAGCAA CAGCAGCGCA AGAAGGAGCT CGAAGCCAAG
CAGCGCGCCG AAGACGAAGC CGAACGCAAG AAGGCGCAGC AGCAAAAGCT CGCCGAACAG
CAGAAGAAAC AGCAGCAGGA GGCCGAGGCC AAGCAGGCCG AAGCGAAGAA GCAGCAGGAA
GCCGCAGCCA AGCAGGCCGC GGCCGACCGC GCCGCAACGC TCAAGCGCAT GCAGGGCCTC
GCGGGCGCGA GCGGCAGCGA CGATTCCAAG GGCACGGCAA TGCGCTCGTC AGGACCGTCG
AGCGGCTACG CCGGACGCAT TGCCGCGGCG GTGCGCCCCA ACATCACCTT CCCCGATGCC
GAGACGGTCA ACGGCAATCC GGCGGCCGAG TTCGAGGTGA ACCTGGCACC GGACGGCACC
ATCGTCGGCG TCAAGCTGAC CAAGTCGAGC GGACTGCCCA GCTGGGACGA AGCCGCCGAA
CGCGGCCTGC ACAAGACCGA CAAGCTGCCG CGCGACACCG ACGGGCGCAT CTTCCCGTCG
CTGATCGTCT CGCTGCGGCC CAAGCGGTAG
 
Protein sequence
MSLALDRPEF APPPQRGTPR AVLLALVAHA LLIAALTWGV RWRSDADEGA VDAELWSSTV 
QQAAPRLSAP QAPTPAPAPP PPAPPPPPPP PQVKAPEPAP PPRAPDIALE REKKLKEEKE
QKERELERQQ QQRKKELEAK QRAEDEAERK KAQQQKLAEQ QKKQQQEAEA KQAEAKKQQE
AAAKQAAADR AATLKRMQGL AGASGSDDSK GTAMRSSGPS SGYAGRIAAA VRPNITFPDA
ETVNGNPAAE FEVNLAPDGT IVGVKLTKSS GLPSWDEAAE RGLHKTDKLP RDTDGRIFPS
LIVSLRPKR