Gene Vapar_5252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5252 
Symbol 
ID7972685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5573102 
End bp5574172 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID644795846 
Productpeptidase M4 thermolysin 
Protein accessionYP_002947120 
Protein GI239818210 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0068319 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTTC GCTCGCCGAG CCTTTTACCG CCCAGCTTCG TACCGCCGTA CCTGCTGGAC 
CGGCTGGCGC AACATGCCGG TGCGCATGCC AGCGCGAAGG CGGCGCAGAC GCTGATGATC
GACCTGCAGC ACCGCGGCCT GCGCGAGGCG GTGGCCGGGC AGGGCGTGTC GTCGGGCCCC
GCGCCCAGCT ATGTGCGGCG CGGCTCGCCC GCGCGCGCCA TCCACGACGC AGAGCACACC
ATGGTCCTGC CGGGGCGGCT GGTGCGTGCC GAAGGACAGG CCGCCACCGG CGACATCGCC
GCCGACGAGG CCTACGACTA CCTGGGCGCC ACCTACCGCC TGTACCACGA CATCTTCGAG
CGCGATTCCA TCGACGGCGC GGGCATGCCG CTCACGGGCA GCGTGCACTA CGGCAACGAC
TACGACAACG CCTTCTGGAA CGGCCAGCAG ATGGTGTTCG GCGACGGCGA CGGCGAGGTC
ATGAACCGCT TCACCATCGC GGTGGACATC ATCGGCCACG AGCTCACGCA CGGCGTGATC
GACCACGAGT CGGGCCTGGT CTACCAGGGG CAGCCGGGCG CGCTCAACGA GTCGATCTGC
GACGTGTTCG GCGCGCTGGT CAAGCAGCAC CTGCTCAAGC AGACCGCGCA GCAGGCCGAC
TGGCTGGTCG GCGCGGGGCT CTTCACCGGC AAGGTCAAGG CGCGCGCACT GCGCTCGATG
GCCGAGCCCG GCACGGCCTA CGACGACCCG GTGCTCGGCA AGGACCCGCA GCCTGCCCAC
ATGAAGGACT TTGTCGACAC GCGCCAGGAC AATGGCGGCG TGCACATCAA TTCCGGCATT
CCGAACCGCG CCTTCCATCT CGCTGCCACG GCCATCCAGG GCCCGGCATG GGAGACGGCC
GGGCGCGTCT GGTACGACAC GGTGTGCGAC CGGCGGCTGC GCCAGGACGC CGATTTCCTG
GCCTTCGCGC AGCTCAGCGT GGAAAATGCA GCCAGGCGCT TCGGCGCCGG CAGCGCGGCG
CACCAGGCCG TCGGCGCTAC ATGGAACACC GTGGGAGTCA CACCATCATG A
 
Protein sequence
MPLRSPSLLP PSFVPPYLLD RLAQHAGAHA SAKAAQTLMI DLQHRGLREA VAGQGVSSGP 
APSYVRRGSP ARAIHDAEHT MVLPGRLVRA EGQAATGDIA ADEAYDYLGA TYRLYHDIFE
RDSIDGAGMP LTGSVHYGND YDNAFWNGQQ MVFGDGDGEV MNRFTIAVDI IGHELTHGVI
DHESGLVYQG QPGALNESIC DVFGALVKQH LLKQTAQQAD WLVGAGLFTG KVKARALRSM
AEPGTAYDDP VLGKDPQPAH MKDFVDTRQD NGGVHINSGI PNRAFHLAAT AIQGPAWETA
GRVWYDTVCD RRLRQDADFL AFAQLSVENA ARRFGAGSAA HQAVGATWNT VGVTPS