Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1555 |
Symbol | |
ID | 4445922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1733728 |
End bp | 1734927 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639689370 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_831049 |
Protein GI | 116670116 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.212463 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGGCGA CCCTCGCGGC TGCAATACAG GCCCAGGAAG CCGCCCTGAA GGCCGGAGTC ATTCCGGCAG CCTCGGTTTC GACGGCCCTC CCCGCATCCC TGCGGCCCGC CCAGCCCTCC GCACCGGCCG AATACACCAT TGCCCGCGGT GACACCATCA GCGGCATTGC GGGTCGCTAC GGCCTGGACA CCAACGCCAT CCTGAAGCTG AACAACCTGC AGGCGAACAC CATCATTTAT CCGGGGCAGA AAATTAAGCT GACCGGTTCG GCGGCGGCCC CCTCCGCTCC CGCGGCCAAG CCGGAGGCTC CCGCTCCTGC CAGCACCGCC GGCAGTGTCT ACACCGTTAA GTCCGGCGAC ACGCTGGGCG CCATTGCGGC ACGGCACGGC GTCAAGTTGT CCGAGGTGTT GAGCTGGAAC GGCCTCAATA TGAACTCCAT CATCTACCCG GGCCAGAAGA TCAAGATCGG CAGCGGCCAG GCACCGGCTC CCGCGCCGGC GGCTACCCCC GCTCCGGCAC CGGCTCCGGC GGCCAACTCG GGCTCCTACA CCGTGAAGTC AGGCGATACC TTGTCCGCCA TCGCCGCAAA GCACGGGGTC AAGCTGTCAG ACATCCTGTC CGCGAACAAG CTGACCATGA CCAGCGTTAT CTTCCCGGGC AACAAGCTGG TCATCCCCGG CGCCTCGATC CAGCCGGCAG CCAGCGTTAC TCCCCTGGTG CCGAGTTCCT TCCTCGGATT CACGTACCCG GCTGCCGTGG TCTCCTCCGC CAACCAGAAC AAGGCACTGC TCAACTCGTC GCCCGTGCCC ACAAGGGAGC AGATGAAGTC GATCGTCGCG GACACCGCGC GTCGGATGGG CGTGGATCCC TCGCTTGCCC TTGCCTTCGC CTACCAGGAG TCAGGCTTCG ACCAGCGCGC AGTCTCGCCG GCCAACGCCA TCGGCACCAT GCAGGTCATC CCGACGTCGG GCGAATGGGC CTCTGATCTC GTGGGCCGCA AGTTGAACCT GCTCGATCCC TACGACAACG CAACCGCAGG CGTGGCCATC ATCCGCCAGC TGATCCGCAC CAGCAAGGAC TTGGACAACG CCATCGCCGG CTACTACCAG GGCCAGTACT CCGTCAGCAA GAACGGCATG TTCGACGACA CGAAGGCATA CGTCGCAGCG ATCAAGGCGC ACAAGAAGAA CTTCAGCTAA
|
Protein sequence | MPATLAAAIQ AQEAALKAGV IPAASVSTAL PASLRPAQPS APAEYTIARG DTISGIAGRY GLDTNAILKL NNLQANTIIY PGQKIKLTGS AAAPSAPAAK PEAPAPASTA GSVYTVKSGD TLGAIAARHG VKLSEVLSWN GLNMNSIIYP GQKIKIGSGQ APAPAPAATP APAPAPAANS GSYTVKSGDT LSAIAAKHGV KLSDILSANK LTMTSVIFPG NKLVIPGASI QPAASVTPLV PSSFLGFTYP AAVVSSANQN KALLNSSPVP TREQMKSIVA DTARRMGVDP SLALAFAYQE SGFDQRAVSP ANAIGTMQVI PTSGEWASDL VGRKLNLLDP YDNATAGVAI IRQLIRTSKD LDNAIAGYYQ GQYSVSKNGM FDDTKAYVAA IKAHKKNFS
|
| |