Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_00180 |
Symbol | lysM |
ID | 7758986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 20470 |
End bp | 21495 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643802945 |
Product | peptidoglycan-binding LysM protein |
Protein accession | YP_002797261 |
Protein GI | 226942188 |
COG category | [S] Function unknown |
COG ID | [COG1652] Uncharacterized protein containing LysM domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0309659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAT CACTACTCGC CCTGCTGCTG GCGGCCTGCA GCGCTCCGGC GCAGGCCGAG GTGCAGCTTC GCGCGGGTCA TCCGCAGAGC TACAGGGTCG TCCAGGGCGA TACCCTGTGG GGCATCTCCG GCAAGTTCCT CCGTGCGCCC TGGCAGTGGC CGCAGCTCTG GCACGTCGAC CCGCGGCTCG AGAATCCCCA CCTGATCTAC CCGGGCGACG TGCTGAACCT GGTCTACATC GACGGTCAGC CGCGCCTGAC GCTCGATCGC GGCGCCAGCC GCGGCACCGT CAGGCTGTCG CCGCGGGTCC GTCGCACGCC CACGGTCCAG GCCATCCCGA CCATTCCGCT GGAGAAGATC GACAGCTTCC TGCTCGGCAA CCGCATCGTC GACGACGCCG CGCAGTTGCG GAAGGCGCCC TACGTGGTCG CCGGCAATGC CGAGCGGGTG ATCAGCGGCG CTGGCGACCG GGTCTACGCG CGCGGCGACT TTTCCGCCGG CGAGGCGGCC TACGGCATCT TTCGCCAGGG CAAGGCCTAC GTCGACCCGG CGACCAGGGA AGTGCTCGGC ATCGACGCCG ACGACATCGG CACCGGCGAA CTGGTCGCCG AGGAAGGCGG CCTCGCCACC CTGCTGCTCA GCCGCTCGAC CCAGGAAGTG CGCATCGGCG ACCGCCTGCT CCCCAGCGAG GAGCGGGCCG TCGACTCCAC CTTCATGCCC AGGGAGCCGA ACGTCCCGGT CGAGGGGGTG ATCCTCGACG TGCCGCGCGG CGTGACCCAG GTCGGCAATT TCGCCGTGGT CACCCTGAAC AAGGGCAGTC GGGACGGGCT GGCCGAGGGC GACGTGCTGG CGGTGTACAA GACCGGCGAA ACCGTGCGCG ACCGCATCGG CGGCGAGCCG GTGAAGATTC CCGATGAGCG TTCCGGCCTG CTGATGGTGT TCCGCACCTA CGCGCGGCTC AGCTACGGCC TGATCCTCAG TGCCAGTCGC CAACTGGCGG TGATGGACAA GGTGCGCAAC CCGTAA
|
Protein sequence | MRKSLLALLL AACSAPAQAE VQLRAGHPQS YRVVQGDTLW GISGKFLRAP WQWPQLWHVD PRLENPHLIY PGDVLNLVYI DGQPRLTLDR GASRGTVRLS PRVRRTPTVQ AIPTIPLEKI DSFLLGNRIV DDAAQLRKAP YVVAGNAERV ISGAGDRVYA RGDFSAGEAA YGIFRQGKAY VDPATREVLG IDADDIGTGE LVAEEGGLAT LLLSRSTQEV RIGDRLLPSE ERAVDSTFMP REPNVPVEGV ILDVPRGVTQ VGNFAVVTLN KGSRDGLAEG DVLAVYKTGE TVRDRIGGEP VKIPDERSGL LMVFRTYARL SYGLILSASR QLAVMDKVRN P
|
| |