Gene Vapar_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_0533 
Symbol 
ID7972943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp593171 
End bp594238 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content70% 
IMG OID644791136 
Productpentapeptide repeat protein 
Protein accessionYP_002942462 
Protein GI239813552 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGCG CACTGCTGAC TCCGAAACTG CTGGCACTCG CCGTGACGCT GGGCGAGCCG 
ATGGAAGACA AGGACTTTGG CGCCGGCGCC TATGCGCGCG CGTACCTGGC CGGCGGCGTG
TTCTCGCGCT GCCGGTTCGA CGAGGCCGAC CTGCGCGGCG CCGACCTGCG CGAGACGCTG
TTCGACCAGT GCAGCTTCAA GGGCGCGCTG ATGGACGGCG CCAGCCTGCG CCGCGCCGTG
CTCAACCGGT GCACCTTCGA CCATGCTGCG CTGCGCAGTG CCGACCTGCA CGGCGCGGTG
CTGACCGACT GCGCGCTGCC GCATGCGCAA CTGACCCAGG CTGTGCTGTC GATGGCAACC
GTCTCGAACT GCGACTTCGC CCATGCCGCG CTGATCGGAG CCGACCTGGA GTCTTCGACC
TTCACGCGGT CGAACCTGGT GGACGTCAAC GCGGACGACA GCCGCTGGGT CCACACCTCG
ATGCTCGCAT GCGGCTTGGC GCAAATGACC TGGGCGCGCG CCCGGATGCA GCGGGTCGTG
TTCCACGAGG TCGACCTGCA GGGGAAATCC TTCGCCGGCC TGTCCCTCGA CGGTTGCCAG
TTCGCGAACT GCAACCTCGG CGGCGCGAGC TTTCGCAACG CGCCGATGCG GCAATGCAAT
TTCCAGGGCG CGCGCCTCGA TCGTGTCGAC TTCTCCGGCG CGCAAGGGCC GATGGCCGTG
TTCTGCGATG CCCGGGGCGA AGCCGTCAAC TTCGCGGGTG CGGGGCTGCG GCAGGCGCTG
TTCACGCGCA GCATGCTGCC CGGCGCGCGC TTCGATGGCG CCGACCTGCA CCAGTGCCAC
TTTGCGGATG CGAAGCTGGC TGCCGCCTCG CTGCGCGACT GCGATCTGAG CTATGCCGAT
TTCAGCCGGG CCGACCTGCA AAAGGTCGAC GGCCGCGGCG CCACCCTGTT GCGCACCGTG
CTGCATCGCG CCGACACCGA GGACGCGCTG TGGACCGACC GCCCGCGGGC GCTGGAAACC
GACGCGGCGT TGGCGCGCGC GGAACTCTGG AGGGGGCCTG CGCCATGA
 
Protein sequence
MPSALLTPKL LALAVTLGEP MEDKDFGAGA YARAYLAGGV FSRCRFDEAD LRGADLRETL 
FDQCSFKGAL MDGASLRRAV LNRCTFDHAA LRSADLHGAV LTDCALPHAQ LTQAVLSMAT
VSNCDFAHAA LIGADLESST FTRSNLVDVN ADDSRWVHTS MLACGLAQMT WARARMQRVV
FHEVDLQGKS FAGLSLDGCQ FANCNLGGAS FRNAPMRQCN FQGARLDRVD FSGAQGPMAV
FCDARGEAVN FAGAGLRQAL FTRSMLPGAR FDGADLHQCH FADAKLAAAS LRDCDLSYAD
FSRADLQKVD GRGATLLRTV LHRADTEDAL WTDRPRALET DAALARAELW RGPAP