Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3356 |
Symbol | |
ID | 5454053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 3594404 |
End bp | 3595453 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640878946 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001414617 |
Protein GI | 154253793 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.681449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGAACG GGGACGATAC GCCGGACGCG TCTGAGCAAG CCTCTGCGCC GAAGAAGTCG CGGTTGGGCC GCTATCTTCT GCTTTCCGCG CTCGTCCTGC CGCTCCTCGC TGCCCTTCTG GCTGCAAGTA TTTTTCTGTA CGGGAAATAC CGGTTCGAGG CGCACGGGCC GCATGAGGAA GCCGTCGTCG TCCTGCTTGC GCCCGGTACA GGCGTCCGCG CCATCGCGTC GCTGCTGGAC CGGGAAGGCG TTATTTCCGA CCCCATGATC TTCCTCGCCG GTGTCCGCTT CCACCGCGCG GAGGGAGACC TCAAGGCCGG CGAATACCGC ATACCCGCCC ACGCCAGCAT GGCCGCGATC ATGGGCATTC TGCGCGAAGG CCGCTCGATA CTTCACCGCA TCACCATCCC CGAAGGCTTG ACCAGCGAGC AGGCAATGCT GCTCGTCGCC GCCAATCCTG TGCTGCTCGG CGAGATGCCG CCCGTCCCCG CGGAAGGCAA AATACTGCCC GAGACCTACA GCTTCACGCG CGGCGCCACG CGGGCGGAAA TCGTTGCCGA GATGCAGAAA GCGGCGAGCG ACCTGCTGGA GCGCTTGTGG GAAGCCCGCG CCGAAAATCT GCCGGTCAAA ACGAAGGAAG AAGCGGTCAT TCTCGCATCC ATCGTGGAGA AGGAAACAGG CGTCGCTTCC GAGCGTCCCC GCGTCGCGGC CGTCTTCACC AATCGCCTGC GCAAGCCCAT GCGCCTCCAG TCCGACCCCA CGATCATCTA CGGTCTGGTC GGAGGCAAAG GCGCTCTGGG CCGTCCGATC CGCCGTAGCG AGCTCGACCG GCTGACCCCC TATAACACCT ATCTCGTGGA CGGCCTGCCG CCGACGCCCA TCTGCAATCC CGGCAAGGCC TCGCTCGAAG CGGTGCTCAA TCCCCCCGAT ACCGATGAGT TCTATTTTGT TGCCGATGGC ACCGGCGGCC ACGCCTTCTC CCGTACGCTG GCCGAACATT TGGAGCGGGT CCGTGAATGG CGGCAGATCG AGCGCCAGAA GGCGCAGTAG
|
Protein sequence | MTNGDDTPDA SEQASAPKKS RLGRYLLLSA LVLPLLAALL AASIFLYGKY RFEAHGPHEE AVVVLLAPGT GVRAIASLLD REGVISDPMI FLAGVRFHRA EGDLKAGEYR IPAHASMAAI MGILREGRSI LHRITIPEGL TSEQAMLLVA ANPVLLGEMP PVPAEGKILP ETYSFTRGAT RAEIVAEMQK AASDLLERLW EARAENLPVK TKEEAVILAS IVEKETGVAS ERPRVAAVFT NRLRKPMRLQ SDPTIIYGLV GGKGALGRPI RRSELDRLTP YNTYLVDGLP PTPICNPGKA SLEAVLNPPD TDEFYFVADG TGGHAFSRTL AEHLERVREW RQIERQKAQ
|
| |