Gene Vapar_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_0158 
Symbol 
ID7971694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp159557 
End bp161110 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content70% 
IMG OID644790761 
Producthistidine ammonia-lyase 
Protein accessionYP_002942087 
Protein GI239813177 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACAA GCAAACACAC CGCCACCCCC TTGATCCTCA CGCCCGGCAA GGTGGACCTT 
GCCATGCTGC GCCGCATCCA GGCCGGCGGC GTGCGGCTGG CGCTCGATCC TTCGGTGCAG
GAGGGCATGG CGCGCGCCGA AGCGGCTGTG CGCCACATCG TCGAGAACGA CCAGGTGGTC
TACGGCATCA ACACCGGCTT CGGCAAGCTC GCGAGCACGC GCATCGGCAA CGACCACCTG
GCCGAGCTGC AGCGCAACCT CGTGCTCTCG CACAGCGTGG GCACGGGCGA GCCGCTGGCC
GCGCCGGTGG TGCGCATGGT GCTCGCGACC AAGGCCGTGA GCCTGGCGCG CGGCCACTCG
GGCGTGCGGC CCGCGCTGGC CGAGGCGCTG CTGGCGCTGT TCAATGCGGG CGTCATGCCG
CGCATTCCGT GCAAGGGCTC GGTCGGCGCC TCGGGCGACC TCGCGCCGCT CGCGCACATG
GCCTGCGTGC TGATCGGCGA GGGCGAGGCC ACCACGGCCG ACGGCGCCGT GGTCAGCGGC
GCCGAAGCCA TGCGCCTCGT CGGCCTCGAA CCCTTTGTGC TCGGCCCCAA GGAAGGCCTG
GCGCTGCTCA ACGGCACGCA GGTGTCGACC GCGCTCGCGC TCGCCGGCCT GTTCGGCGCG
GAGGACGTGT TCGCTTCGGC GCTGATGTCG GGTGCGCTCT CGCTCGAAGC CATCCAGGGT
TCGATCAAGC CCTTCGATGC GCGCATCCAT GCCGCGCGCG GCCAGCCGGG GCAGATCGCG
GTGGCGGGCG CGGTGCGCAC GCTGCTCGAA GGCAGCGAGA TCGTCCCTTC GCACGCCGAC
TGCGGCCGCG TGCAGGACCC GTATTCGGTG CGCTGCATTC CGCAGGTGAT GGGCGCCTGC
CTCGACAACC TCGCGCATGC CGCGCGCGTG CTGGTGATCG AGGCCAATGC CGCCTCGGAC
AACCCGCTGG TGTTCACCGA CACCGGCGAA GTGATCTCGG GCGGCAACTT CCACGCCGAG
CCGGTGGCCT TTGCGGCCGA CATCATTGCG CTGGCAGTGA GCGAAGTGGG CGCGATTGCC
GAGCGCCGCA TCGCGCTGCT GCTCGACACC GGCCTGTCGG GCCTGCCGCC GTTCCTGGTG
CGCGATGGCG GCCTGAACTC GGGCTTCATG ATCGCGCAGG TCACGGCCGC GGCGCTGGCG
TCGGAGAACA AGTCGCTCGC GCATCCCGCG AGCGTCGACA GCCTGCCCAC TTCGGCCAAC
CAGGAAGACC ACGTGTCGAT GGCCACCTTC GCGGCGCGCC GGCTCGGCGA CATGGTCAAC
AACACGGCGG TGGTCGTCGG CATCGAGGCG ATGGCCGCGG CACAAGGCAT CGAACTCAAG
CGGGGGCTCA AGAGCTCCCC GCTGGTCGAA GCCGAATTCG CCGCCATCCG CCAGAAGGTC
GCTTTTCTCG AACGCGACCG CTACCTTGCG CCCGACATCG AAGCGATGCG CCAGTGGGCG
CTGAAGGCCG AGCTGCCGGC CGCGCTCTTG AACATCCTGC CCAGCCACGC CTGA
 
Protein sequence
MPTSKHTATP LILTPGKVDL AMLRRIQAGG VRLALDPSVQ EGMARAEAAV RHIVENDQVV 
YGINTGFGKL ASTRIGNDHL AELQRNLVLS HSVGTGEPLA APVVRMVLAT KAVSLARGHS
GVRPALAEAL LALFNAGVMP RIPCKGSVGA SGDLAPLAHM ACVLIGEGEA TTADGAVVSG
AEAMRLVGLE PFVLGPKEGL ALLNGTQVST ALALAGLFGA EDVFASALMS GALSLEAIQG
SIKPFDARIH AARGQPGQIA VAGAVRTLLE GSEIVPSHAD CGRVQDPYSV RCIPQVMGAC
LDNLAHAARV LVIEANAASD NPLVFTDTGE VISGGNFHAE PVAFAADIIA LAVSEVGAIA
ERRIALLLDT GLSGLPPFLV RDGGLNSGFM IAQVTAAALA SENKSLAHPA SVDSLPTSAN
QEDHVSMATF AARRLGDMVN NTAVVVGIEA MAAAQGIELK RGLKSSPLVE AEFAAIRQKV
AFLERDRYLA PDIEAMRQWA LKAELPAALL NILPSHA