Gene Arth_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1555 
Symbol 
ID4445922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1733728 
End bp1734927 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID639689370 
Productpeptidoglycan-binding LysM 
Protein accessionYP_831049 
Protein GI116670116 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.212463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGGCGA CCCTCGCGGC TGCAATACAG GCCCAGGAAG CCGCCCTGAA GGCCGGAGTC 
ATTCCGGCAG CCTCGGTTTC GACGGCCCTC CCCGCATCCC TGCGGCCCGC CCAGCCCTCC
GCACCGGCCG AATACACCAT TGCCCGCGGT GACACCATCA GCGGCATTGC GGGTCGCTAC
GGCCTGGACA CCAACGCCAT CCTGAAGCTG AACAACCTGC AGGCGAACAC CATCATTTAT
CCGGGGCAGA AAATTAAGCT GACCGGTTCG GCGGCGGCCC CCTCCGCTCC CGCGGCCAAG
CCGGAGGCTC CCGCTCCTGC CAGCACCGCC GGCAGTGTCT ACACCGTTAA GTCCGGCGAC
ACGCTGGGCG CCATTGCGGC ACGGCACGGC GTCAAGTTGT CCGAGGTGTT GAGCTGGAAC
GGCCTCAATA TGAACTCCAT CATCTACCCG GGCCAGAAGA TCAAGATCGG CAGCGGCCAG
GCACCGGCTC CCGCGCCGGC GGCTACCCCC GCTCCGGCAC CGGCTCCGGC GGCCAACTCG
GGCTCCTACA CCGTGAAGTC AGGCGATACC TTGTCCGCCA TCGCCGCAAA GCACGGGGTC
AAGCTGTCAG ACATCCTGTC CGCGAACAAG CTGACCATGA CCAGCGTTAT CTTCCCGGGC
AACAAGCTGG TCATCCCCGG CGCCTCGATC CAGCCGGCAG CCAGCGTTAC TCCCCTGGTG
CCGAGTTCCT TCCTCGGATT CACGTACCCG GCTGCCGTGG TCTCCTCCGC CAACCAGAAC
AAGGCACTGC TCAACTCGTC GCCCGTGCCC ACAAGGGAGC AGATGAAGTC GATCGTCGCG
GACACCGCGC GTCGGATGGG CGTGGATCCC TCGCTTGCCC TTGCCTTCGC CTACCAGGAG
TCAGGCTTCG ACCAGCGCGC AGTCTCGCCG GCCAACGCCA TCGGCACCAT GCAGGTCATC
CCGACGTCGG GCGAATGGGC CTCTGATCTC GTGGGCCGCA AGTTGAACCT GCTCGATCCC
TACGACAACG CAACCGCAGG CGTGGCCATC ATCCGCCAGC TGATCCGCAC CAGCAAGGAC
TTGGACAACG CCATCGCCGG CTACTACCAG GGCCAGTACT CCGTCAGCAA GAACGGCATG
TTCGACGACA CGAAGGCATA CGTCGCAGCG ATCAAGGCGC ACAAGAAGAA CTTCAGCTAA
 
Protein sequence
MPATLAAAIQ AQEAALKAGV IPAASVSTAL PASLRPAQPS APAEYTIARG DTISGIAGRY 
GLDTNAILKL NNLQANTIIY PGQKIKLTGS AAAPSAPAAK PEAPAPASTA GSVYTVKSGD
TLGAIAARHG VKLSEVLSWN GLNMNSIIYP GQKIKIGSGQ APAPAPAATP APAPAPAANS
GSYTVKSGDT LSAIAAKHGV KLSDILSANK LTMTSVIFPG NKLVIPGASI QPAASVTPLV
PSSFLGFTYP AAVVSSANQN KALLNSSPVP TREQMKSIVA DTARRMGVDP SLALAFAYQE
SGFDQRAVSP ANAIGTMQVI PTSGEWASDL VGRKLNLLDP YDNATAGVAI IRQLIRTSKD
LDNAIAGYYQ GQYSVSKNGM FDDTKAYVAA IKAHKKNFS