Gene Cag_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1379 
Symbol 
ID3747640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1842366 
End bp1844036 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content47% 
IMG OID637773915 
Productpeptidoglycan-binding LysM 
Protein accessionYP_379680 
Protein GI78189342 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTGGC TGCCCACAGC ACTGCCGCTT GAGGCTGCTG AGCCTGCTCG CAATAACCCC 
AATGCGTTGC GGCGCTCCTC TATTTCGGAT GTTCTCGATA GCCTTGTAAA CGCTACCTAT
TTTAAGGATG AGTACTTTAC CGCACCGTCG CGTGAAGGTG GCGTAAGTTT TCCATCAACC
TTTGTGCCTC AATTTAGCGA TTCCGTTTAT AGCTCACGTA TTGCGGCACT TCGGCGCAAA
ACACCCATGC CACTTGTTTA TAACGCTCAA GTGAAAGGGT ACATTCGCAT GTATGCCGTT
GAAAAGCGTA GCTATACCGC TAAAATTTTA GGTTTAACAA AAATTTATTT TCCTCTCTTT
GAAGAGAAGT TTGATACCTA CAATGTGCCG CTTGAAATGA AATACTTGGC GATTGTGGAA
TCGGCACTGA ATCCCACAGC CGTCTCGCCA GCAGGTGCAA AAGGGTTGTG GCAGTTTATG
TATGGCACGG GTAAAATGTA TGGCTTGGAG TCCTCCTCGT TTATTGAAGA TCGTTACGAT
CCTTATAAAT CAAGCGTTGC AGCGGCTCGC CATTTGCGCG ATTTATATCA AATTTATGGT
GATTGGTTTT TAGCCCTTGC CGCTTACAAT GCTGGTCCTG GTAACGTTAA TAAAGCAATT
CGTCGCGCTG GTGGTGTAAA AGATTATTGG GCAATTTGGG ACTATCTGCC AGCCGAAACA
CGAGGCTACG TGCCTGCCTT TATAGCAGTT CACTACATTA TGAGCTACCA TAATGAGCAC
AACATTCGTC CGCTTGAACC AGCCTATCTC TATCGCGATA TTGATACGTT GCGTACCTCA
CGCATGGTTA CCTTTGAGCA AATCAGCGAA ACGCTTGGTA TTTCAGCCTC CGATTTAGAG
TTCCTTAATC CGCAATACAA AATTGGAGTG ATTCCTGCCT CTACGGGCAA TGGTAACGTT
ATTCGGTTGC CTCGTCGGTA TGTAGCCCAA TTTCAACGCC GCGAACAAGA AATTTATGCT
TATCGCTCCG CTCGCACTAT GGAGCGTGAA GCGCTCTATG CTCGGCTTGA AAGTGTGCGA
GCTGGTGCAG GTGAGCAAAG TAGTGAAAGC AGTAAAGGCA TGGGGAACCA GAAAATTCAC
ATTGTACAGC GAGGTGAAAC GCTTGGCTCG GTTGCACGCT TGTATCGCAC CTACATTAGC
CAGCTTATTG CATGGAATAA TCTTGTAGAT GCTGATATTA TGGTTGGGCA ACGCTTAGTA
GTGTTTGGTG GCGAGGATAA TAGTCCAGTT GCTGCACCTG AGCCACCAAA AAGCACTGTT
CCACCGAAGG CTCCCCCTAT AGAGCGTCAA CCAACGGCAG CACCAGAAGT GCGTGCAGCG
GCTCCCCCCA AACGCATTGC TGTAACACGT TCAACCCAAA CGGTTACTCG CGATGAGTTG
GTAGCACTTA CTGAAACACC AACCGTGGCT ACCGATAACA CAAGCGCAAA GGCTGAGCCA
ATATTCCATG TAGTAGAGCC AGGGCAAACG CTGTTTGCAA TTGCTACACA ACGCAAGGTA
ACGGTGAATC AGCTCATGCT TTGGAACAAT CTTAAAAGTG TTCAGATTAA AGCAGGGCAA
AAGCTCATTG TTTCAAGCGA TGGGCAAAGT GGGCGCGATA ACAGTCAATA G
 
Protein sequence
MLWLPTALPL EAAEPARNNP NALRRSSISD VLDSLVNATY FKDEYFTAPS REGGVSFPST 
FVPQFSDSVY SSRIAALRRK TPMPLVYNAQ VKGYIRMYAV EKRSYTAKIL GLTKIYFPLF
EEKFDTYNVP LEMKYLAIVE SALNPTAVSP AGAKGLWQFM YGTGKMYGLE SSSFIEDRYD
PYKSSVAAAR HLRDLYQIYG DWFLALAAYN AGPGNVNKAI RRAGGVKDYW AIWDYLPAET
RGYVPAFIAV HYIMSYHNEH NIRPLEPAYL YRDIDTLRTS RMVTFEQISE TLGISASDLE
FLNPQYKIGV IPASTGNGNV IRLPRRYVAQ FQRREQEIYA YRSARTMERE ALYARLESVR
AGAGEQSSES SKGMGNQKIH IVQRGETLGS VARLYRTYIS QLIAWNNLVD ADIMVGQRLV
VFGGEDNSPV AAPEPPKSTV PPKAPPIERQ PTAAPEVRAA APPKRIAVTR STQTVTRDEL
VALTETPTVA TDNTSAKAEP IFHVVEPGQT LFAIATQRKV TVNQLMLWNN LKSVQIKAGQ
KLIVSSDGQS GRDNSQ